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NEW POLYPEPTIDES AND THEIR USE FOR THE RESCUE OF FUSION 
DEFECTIVE VIRUS OR RETROVIRUS GLYCOPROTEINS 

The invention relates to new polypeptides, and their use for the rescue of fusion 
5 defective virus or retrovirus glycoproteins. 

Retroviruses enter cells following their attachment to specific cell surface receptors. 
This function is mediated by the envelope glycoproteins expressed as trimeric complexes on 
the viral particles (Hunter, 1997). Interaction with the receptor is thought to cause a 
conformational change of the glycoprotein structure necessary to expose a "fusion peptide", 

10 buried within the native envelope complex and involved in the actual membrane fusion 
process (Gerlier, 1998). On the basis of both genetic (Battini et al., 1995; Battini et al., 
1992; Morgan et al., 1993; Ott and Rein, 1992) and structural (Fass et al., 1997; Fass et 
al., 1996) evidence, the attachment and fusion functions of murine leukemia virus (MLV) 
envelopes have been separated on the two subunits of the envelope monomer. Thus it is 

15 generally considered that the ammo-terminal half of the MLV SU (surface) subunit is 
responsible for the binding to the receptor whereas the ectodomain of the TM 
(transmembrane) subunit contains the "fusion machinery" composed of an amino-terminal 
fusion peptide associated to a coiled-coil structure that rearrange upon receptor interaction. 
How retrovirus envelope glycoproteins convert binding to the receptor to a signal 

20 which activates the fusion machinery is currently unknown. Ample evidence from the 
literature indicate that functional inter-relations between different domains of the envelope 
complex are essential to promote fusion activation (Lavillette et al., 1998; Rein et al., i998; 
Zhao et al., 1997; Zhao et al., 1998). 

Retroviruses can be involved in two different types of membrane fusion : 

25 - a fusion between viral and cell membranes triggered by the interaction of the virion 
envelope glycoproteins with specific cell surface molecules, which are the viral receptors 
(virus-cell fusion), with said fusion being responsible for retroviral infection, 
- a fusion between two cell membranes triggered by the interaction of the envelope 
glycoproteins expressed at the surface of a cell and the viral receptors expressed on a 

30 neighbouring cell (cell-cell fusion), the visible outcome of this latter being the formation of 
cell syncytia containing up to 100 nuclei (also known as polykaryocytes or multinucleated 
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giant cells); syncytium-formation results in the death of the cells which make up the 
syncytium. 

Furthermore, cleavage of the R peptide (inhibitory peptide on the cytoplasmic tail of 
the TM subunit of the envelope glycoprotein) is required to activate fully the fusogenic 
5 potential of these envelope glycoproteins and mutants lacking the R peptide show greatly 
enhanced activity in cell fusion assays (Rein et al, J. Virol., 1994, 68, 1773-1781; Ragheb 
& Anderson, J. ViroL, 1994, 68, 3220-3231; Brody et al, J. Virol., 1994, 68, 4620-4627). 

Furthermore, it is known that a vector directing the expression on a eukaryotic cell 
surface of a syncytium-inducing polypeptide can be used in gene therapy of malignant 
10 diseases, as a result of selective elimination of unwanted cells, through syncytia formation 
(WO 98/40492). 

The fusogenicity of envelope glycoproteins has been enhanced as described in 
Lavillette et al. (1998) and internal application WO 99/36561. The proline-rich region 
(PRR) of amphotropic MLV envelope glycoprotein was shown to be an essential 
15 determinant of membrane fusion activation and specific PRR-mutated envelope 
glycoproteins were obtained and were characterised by an increased cell-cell or virus-cell 
fusogenicity. 

Studies on Mo-MLV envelope protein GP70 have shown that the first eight amino 
acids of the mature gp70 protein are essential for fusion and virus entry. Within this region, 
20 a particularly important residue is H8. This residue belongs to the PHQ motif which is 
conserved among all MLVs and some other retroviruses, such as a gibbon ape leukemia 
virus SEATO and feline leukemia virus types A, B, and C. The amphotropic counterpart of 
Del2-8, which has the first six amino acids (Met-Ala-Glu-Ser-Pro-His) deleted has been 
constructed. Vectors bearing the deletion mutant envelope protein could not transduce NIH 
25 3T3 or HeLa cells (Bae et aL, Journal of Virology, March 1997, p. 2092-2099). 

As the fusion mechanisms induced by interaction of retrovirus envelope 
glycoproteins and receptors of cell membrane are partially unknown, the regulation of the 
fusogenicity of retroviral or viral envelope glycoproteins was up to day not possible. 

One of the aims of the invention is to enable fusion control of envelope glycoproteins 
30 with target cells. 

It is an aim of the invention to provide means for the rescue of the fusion property of 
fusion defective viral or retroviral envelope glycoproteins. 
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It is another aim of the invention to provide a process for restoring the fusogenic 
property of retroviral or viral envelope glycoproteins, the fusogenic property of which has 
been inactivated and for controlling their activation by external means. 

It is another aim of the invention to provide a process in which envelope 
5 glycoproteins are in a first stage inactivated as to their fusion property and then in a second 
stage activated in a controlled and regulated way, by means of activating polypeptides, 
which possibly present targeting capacities, and which are able to restore efficiently the 
fusogenic and cytotoxic properties of said envelope glycoproteins. 

It is another aim of the invention to provide polypeptides liable to activate the cell- 
10 cell fiisogenicity or the virus-cell fusogenicity. 

It is another aim of the invention to provide polypeptides liable to activate the virus- 
cell fusogenicity and which thus can be used in strategies involving retroviral vectors 
comprising envelope glycoproteins, the fusion property of which can be activated. 

It is another aim of the invention to provide fusogenic envelope glycoproteins 
15 presenting cytotoxic properties for use in the treatment of malignant diseases, for which the 
fusogenic property could be controlled by means of activating polypeptides. 

It is another aim of the invention to provide cytotoxic fusogenic envelope 
glycoproteins having cellular specificity and for which the fusogenic property can be 
controlled by means of activating polypeptides. 
20 All these aspects are achieved through the present invention, which in one of its 

broad embodiments is the use of at least one activating polypeptide comprising an amino 
acid sequence corresponding to : 

- the amino acid sequence of the surface subunit (SU) of an envelope glycoprotein of a first 
virus, or of a first retrovirus, or 

25 - an amino acid sequence derived from said surface subunit, or 

- a fragment of an amino acid sequence of said surface subunit, encompassing the receptor 
binding domain (RBD) and the amino terminal part of the surface subunit (SU), 

for the rescue of the fusion property of a fusion defective envelope glycoprotein, which has 
the property of attachment to target cells and is inactive with respect to the fusion with said 
30 target cells, said fusion defective envelope glycoprotein belonging to a second virus or a 
second retrovirus of the same type as or of a different type from the above designated "first 
virus or first retrovirus". 
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Thus the invention lies in the possibility of desactivating the inactivation of the 
fusogenic property of an envelope glycoprotein, said envelope glycoprotein having 
previously been made inactive with respect to fusion. This restoration of the fusion property 
of the envelope glycoprotein is unexpectedly achieved by polypeptides whose common 
5 structure is the presence of the receptor binding domain (RBD) and the amino terminal part 
of SU in the environment of cells expressing fusion defective envelope glycoproteins and of 
cells expressing retroviral or viral receptors and the interaction of which will lead to the 
formation of syncytia, or in the environment of retroviruses incorporating fusion defective 
envelope glycoproteins and of cells expressing retroviral or viral receptors and the 

10 interaction of which will lead to infection. 

The expression "polypeptide comprising an amino acid sequence corresponding to", 
means that the polypeptide has the same amino sequence as that of the surface subunit, or as 
that of an aminoacid sequence derived from the surface subunit or of a fragment and can be 
prepared by recombinant technique or by chemical synthesis. 

15 The expression "polypeptide comprising an amino acid sequence derived from said 

surface subunit" means that the polypeptide has an aminoacid sequence which is not strictly 
identical to that of the sequence of the surface subunits and may contain mutations such as 
deletion or substitution of aminoacids of the parental sequence or insertion of aminoacid 
sequence of other origins. 

20 The "amino terminal part of the surface subunit" is defined as the aminoacid 

sequence of the first aminoacids of a length of about 10 to about 15 aminoacids of the 
envelope surface subunit, not including the aminoacid sequence of the signal peptide. 

A fusion defective envelope glycoprotein derives from a parental envelope 
glycoprotein in which the fusogenic property has been inactivated, but which still presents 

25 the property of recognizing (attachment) target cells. 

The expression "inactive with respect to the fusion" means that in a test for the measure of 
the fusogenic activity, the fusion index is either mil or significantly reduced when the fusion 
assay is allowed to proceed for about 20 minutes. 

The rescue of the fusion property can be measured according to one of the following 

30 tests : 

- test of formation of syncytia : expressing vectors encoding the envelope glycoprotein to be 
tested are transfected into cells on day 0. On day 2, the transfected cells are overlaid by 
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indicator cells which express the cell surface receptors corresponding to the tested envelope 
glycoprotein. Detection of syncytia (i.e. fusion of envelope-expressing cells with 
neighbouring indicator cells) is performed at day 3 or day 4 and fusion indexes are 
measured as previously described (Lavillette et al, 1998) by counting the number of 
5 syncytia and the number of nuclei outside and inside the syncytia. 

- test of infection of target cells : retroviral vectors carrying a lac Z transgene are being 
generated with the tested envelope glycoproteins. Serial dilutions of the vectors are applied 
on target cells at day 0. Infected cells are stained at day 2 with X-Gal to reveal the number 
of cell colonies having integrated the lac Z-vectors and titers are determined according to 

10 standard procedures (Lavillette at al., 1998). 

It is to be noted that the activating polypeptide corresponds to the amino acid 
sequence of the SU of an envelope glycoprotein of a first virus or retrovirus and that the 
fusion defective envelope glycoprotein belongs to a second virus or retrovirus of the same 
type or of a type different from the first virus or retrovirus. This means that there are in fact 

15 two possibilities : 

- either the first and second viruses are of the same type, which means that the activating 
polypeptide and the fusion defective envelope glycoprotein are homologous, 

- or the first and second viruses are of a different type, which means that the activating 
polypeptide and the fusion defective envelope glycoprotein are heterologous. 

20 According to an advantageous embodiment the fusion defective envelope 

glycoprotein involved are those derived from retroviruses. 

Retroviral envelope glycoproteins allow receptor binding of the viral particles onto 
which they have been incorporated during virion assembly. This preliminary attachment on 
the viral receptor activates a second function of the retroviral envelope glycoproteins which 

25 induces the fusion between the viral and the target cell membranes. While for type C 
mammalian retroviruses the determinants of receptor binding (RBD, receptor binding 
domain), are located at the ammo-terminal end of the SU envelope subunit, the molecular 
determinants that are responsible of membrane fusion are located in the TM subunit of the 
glycoprotein. The molecular determinants or events which are involved in the conversion of 

30 binding to the receptor into a signal which activates the TM fusion determinants are not 
precisely known although by mutagenesis analysis, they are believed to be present in all the 
SU subunit, in particular in the N-terminal end of the RBD itself (Bae et al., 1997; 
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Lavillette et al., 2000), in tbe proline-rich region (Lavillette et al., 1998) and in the 
carboxy-terminal domain of the SU (Nussbaum et al., 1993; Pinter et al., 1997). Thus the 
control of the activation of the fusion machinery harbored by the TM subunit might be 
achieved by introducing specific mutations that affect the fusion-activation determinant of 
5 the SU. This invention describes several mutations in the SU which can inhibit or partially 
inhibit the SU determinants which control activation of the TM fusion-machinery. It is 
important to understand that such mutations do not impair the fusogenic potential of the 
mutant envelope glycoproteins, but only impair the molecular determinants that control the 
activation of the fusion machinery. The latter one is still intact, despite the mutations 

10 introduced in the SU, but is not activated upon receptor binding. In addition the invention 
describes the use of soluble polypeptides that can alleviate the effect of these mutations and 
hence couple the fusion machinery to a device that is provided exogenously in substitution 
to the normal fusion control determinants which are disrupted in the mutant envelope 
glycoproteins. Thus the addition of the "activating" polypeptides can restore or rescue the 

15 fusogenicity of the mutant envelope glycoproteins. 

Several mutations or modifications of the envelope glycoprotein can be used to 
reversibly inhibit or reduce their fusogenicity. 

A first method is the introduction of the delH mutation that removes an histidine 
from the PHQ conserved motif located at the ammo-terminal end of the SU envelope 

20 subunit of type C mammalian retroviruses such as MLV-A, MoMLV and GALV (mutations 
H5del, H8del or H9del, respectively). Such a mutation induces a drastic effect in the 
capacity of the receptor binding domain (RBD) of these envelope glycoproteins to triggers 
further events required to promote membrane fusion. 

A second method consists in the replacement of the SU C-terminal domain (C 

25 domain) with an heterologous C domain derived from the envelope glycoprotein of another 
type C mammalian retrovirus. For example; the chimeric CMO amphotropic glycoprotein 
(hereafter defined) can be constructed by combining the amphotropic MLV RBD and PRR 
(proline-rich region) with the ecotropic MoMLV C-terminal domain (example 4, Fig. 29A 
and 29B). For such chimeras, there is a low compatibility between their RBD (amphotropic 

30 origin) and C domain (ecotropic origin), which significantly impairs the kinetic of 
membrane fusion. 
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A third method consists in the replacement of the RBD by a ligand such as a 
cytokine or a single chain antibody targeted to a cell surface specific receptor. In contrast to 
the RBD of the SU of a type C mammalian retrovirus, such ligands do not contain the 
determinants that allow further activation of the Env fusogenicity. Such chimeric envelope 
5 glycoproteins therefore resembles a delH envelope glycoprotein which would carry an RBD 
that is targeted to a specific receptor. 

This invention relates to a method which can alleviate the inhibitory effect of these 
different mutations and thus restore the fusogenic properties of the mutant envelope 
glycoproteins. This method uses preparations of polypeptides which are derived from the 

10 RBD of different type C mammalian retrovirus envelope glycoproteins. For example, the 
addition of an RBD which is compatible with the C domain of a fusion-defective envelope 
glycoprotein will fully restore its fusogenic properties and will activate the kinetic of 
membrane fusion. Similarly such a peptide will also restore the fusogenicity of envelope 
glycoproteins that harbor a delH mutation or, alternatively, which carry a cell-type specific 

15 ligand in place of the RBD. 

This method can be applied to the activation of infectivity of retroviral vectors whose 
envelope glycoproteins carry one of these fusion-defective mutations. Depending on the 
presence of a ligand in the fusion-deficient envelope glycoprotein or, alternatively, in the 
activating polypeptide, control of infection to specific target cell types will therefore be 

20 obtained. The invention can also be applied to the control of the hyper-fusogenic activity of 
retroviral envelope glycoproteins that can induce cell-cell fusion and elimination of 
unwanted cells by syncytia formation. Indeed while most retroviral envelope glycoproteins 
are not truly fusogenic when expressed at the surface of the cells that produce the retroviral 
particles, they acquire their fusion competency during or after assembly and release of the 

25 viral particles. Yet there is considerable interest to convert the retroviral envelope 
glycoproteins into hyperfusogenic glycoproteins that can fuse cells in the close vicinity of 
the cells into which they are expressed. In addition to the introduction of the above- 
mentioned mutations that can control activation of their fusogenicity, the fusogenicity of 
retroviral envelope glycoproteins can be stimulated by several ways to obtain the best effect 

30 upon activation with an exogenous activating polypeptide. Thus, additional mutations will 
be introduced into the retroviral envelope glycoprotein, such as the truncation of the 
cytoplasmic tail (Rless mutation) and/or the modification of their proline-rich regions. 
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According to an advantageous embodiment, the activating polypeptide used in the 
invention recognizes substantially the same receptors of the target cells as the fusion 
defective envelope glycoprotein. 

According to another advantageous embodiment, the activating polypeptide used in 
5 the invention recognizes receptors of the target cells which are different from the ones 
recognized by the fusion defective envelope glycoprotein. 

It is thus quite unexpected that the activating polypeptides can restore the 
fusogenicity of fusion defective envelope glycoproteins by interacting with retroviral 
receptors which are expressed at the surface of the same target cells, and which are different 
10 from the receptors recognized by the fusion defective envelope glycoprotein. 

According to another advantageous embodiment of the invention, the activating 
polypeptide and/or the fusion defective envelope glycoprotein is (are) linked to a ligand, 
which is specific with respect to some receptors of the target cells, the ligand being 
preferably linked to the activating polypeptide. 
15 The addition of specific ligands on the activating polypeptides and/or on the fusion 

defective envelope glycoprotein enables to control the fusion activity of said fusion defective 
envelope glycoprotein and to increase the specificity with respect to target cells. Therefore, 
the design of a specific system to eliminate unwanted cells, may consist in a fusion defective 
envelope glycoprotein associated to an activating polypeptide that is linked to a ligand that 
20 can specifically recognize a receptor expressed on the target cells, whereas the activating 
polypeptide recognizes the same receptor or a different receptor expressed on the target 
cell. 

The link between the activating polypeptide and the ligand is preferably a covalent 
bond, whereby both the activating polypeptide and the ligand have been genetically fused 
25 and encoded by an expression vector as a single protein chain, and whereby the ligand is 
fused either to the ammo-terminus or to the carboxy terminus of the activating polypeptide. 

According to another advantageous embodiment, the ligand is chosen among 
molecules liable to recognize receptors of target cells, for instance receptors of tumoral 
cells, such as EGF, FGF, VEGF, or liable to recognize epitope of proteins expressed by the 
30 target cells, such as recombinant monoclonal antibody. 

According to another advantageous embodiment, the activating polypeptide is under 
the form of a product as such, or is expressed by a host, such as a virus or a cell, or is 



WO 00/71578 



9 



PCT/EP00/04534 



expressed, in particular by the target cells, transformed by nucleotide sequences coding for 
said activating polypeptide. 

In the use of the invention, one resorts either directly to chemical substances (which 
is the meaning of activating polypeptide under the form of a product as such), or to host 
(virus or cell) expressing said activating polypeptide. 

In the following of the text, the expression "soluble activating polypeptides" will be 
sometimes used, meaning that said polypeptides are not integrated in the envelope proteins 
of a virus. 

The capacity of the activating polypeptides to rescue the infectivity of fusion 
defective envelope glycoproteins is similar whether the target cells i) constitutively express 
and secrete the activating polypeptides, ii) are co-incubated with both fusion defective 
envelope glycoprotein and said activating polypeptides iii) are pre-incubated with said 
activating polypeptides before adding the fusion defective envelope glycoprotein, or iv) are 
pre-incubated with the fusion defective envelope glycoprotein before adding the activating 
polypeptides. 

In an advantageous use of the invention the fusion defective envelope glycoprotein is 
expressed by a host, such as a virus incorporating in its envelope said fusion defective 
envelope glycoprotein, or such as a cell, transformed by nucleotide sequences coding for 
said fusion defective envelope glycoprotein. 

According to the invention, the activating polypeptide advantageously corresponds 
to a fragment of about 200 to about 250 amino acids of said surface subunit comprising the 
receptor binding domain (RBD) and comprising the first amino acids of the amino terminal 
part of the surface subunit, the number of said first aminoacids varying from about 10 to 
about 15, in particular comprising the RBD and the PHQ peptide, in particular the 
fragments corresponding to amino acid sequences derived from type C mammalian 
retroviruses such as: 
- Murine leukemia virus such as : 

. amphotropic 

. ecotropic 

. xenotropic 

. polytropic 

. 10 Al MLV 
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- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV-B, 
FELV-C 

in particular wherein the activating polypeptide corresponds to one of the following amino 
acid sequences : 

5 

A-RBD 
E-RBD 
GALV-RBD 
34-1-A-RBD 
10 34-1-E-RBD 

34-1-GALV-RBD 
EGF-A-RBD 
EGF-E-RBD 
EGF GALV-RBD 

15 - A-RBD corresponds to the activating polypeptide comprising the RBD domain and the 
PHQV motif of the amphotropic MLV surface subunit. 

- E-RBD corresponds to the activating polypeptide comprising the RBD domain and the 
PHQV motif of the ecotropic MLV surface subunit. 

- GALV-RBD corresponds to the activating polypeptide comprising the RBD domain and 
20 the PHQP motif of the GALV surface subunit. 

- 34.1-A-RBD corresponds to the activating polypeptide A-RBD above defined, linked to a 
ligand which is a single chain antibody directed against MHCI. 

- 34.1-E-RBD corresponds to the activating polypeptide E-RBD above defined, linked to a 
ligand which is a single chain antibody directed against MHCI. 

25 - 34.1 GALV-RBD corresponds to the activating polypeptide GALV-RBD above defined, 
linked to a ligand which is a single chain antibody directed against MHCI. 

- EGF-A-RBD corresponds to the activating polypeptide A-RBD above defined, linked to a 
ligand which is the epidermal growth factor. 

- EFG-E-RBD corresponds to the activating polypeptide E-RBD above defined, linked to a 
30 ligand which is the epidermal growth factor. 

- EGF-GALV-RBD corresponds to the activating polypeptide GALV-RBD above defined, 
linked to a ligand which is the epidermal growth factor. 
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According to the invention, the fusion defective envelope glycoprotein 
advantageously corresponds to the amino acid sequence of a mutated form of an envelope 
glycoprotein of a virus or a retrovirus which, in its non mutated form, presents attachment 
and fusion properties with respect to target cells, said fusion defective envelope 
5 glycoprotein comprising one or several mutations in the amino terminal part of its surface 
subunit, particularly in the first amino acids of the amino terminal part of the surface 
subunit, the number of said first aminoacids varying from about 10 to about 15, said 
mutations inactivating the fusion property without substantially affecting the attachment 
property, in particular comprises one or several mutations within the PHQ amino sequence, 
10 wherein in particular the fusion defective envelope glycoprotein is derived from type C 
mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 

15 . xenotropic 

. polytropic 
. 10A1MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV-B, 
FELV-C, 

20 in particular wherein the fusion defective envelope glycoprotein corresponds to one of the 
following amino acid sequences : 
-MO del H 

- AdelH 

- PROMOdelH 
25 - GALVdelH 

- C2delH 

-MO del HR less 

- AdelHRless 

- PROMOdelHRless 
30 - GALVdelHRless 

- C2delHRless 
-CMO 
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-TM MO 
-CTMMO 
- C MOdelH 
-TMMOdelH 
5 -CTMMOdelh 
-34.1-A 

* MO del H corresponds to the ecotropic MLV envelope glycoprotein, in which the 
histidine residue of the PHQV motif is deleted (i.e. the 8th residue of the SU subunit is 

10 removed). 

* AdelH corresponds to the amphotropic MLV envelope glycoprotein, in which the histidine 
residue of the PHQV motif is deleted (i.e. the 5th residue of the SU subunit is removed). 

* PROMOdelH corresponds to the amphotropic MLV envelope glycoprotein, in which the 
proline rich region has been replaced by the proline rich region of the ecotropic MLV 

15 envelope and in which the histidine residue of the PHQV motif is deleted. 

* GALVdelH corresponds to the GALV envelope glycoprotein, (gibbon ape leukemia virus 
envelope glycoprotein), in which the histidine residue of the PHQP motif is deleted (i.e. the 
9th residue of the SU subunit is removed). 

* C2delH corresponds to the amphotropic MLV envelope glycoprotein, in which the proline 
20 rich region has been modified by a small deletion at its carboxy terminus and in which the 

histidine residue of the PHQV motif is deleted (i.e. the 5th residue of the SU subunit is 
removed). 

* MO del H R less corresponds to the ecotropic MLV envelope glycoprotein, in which the 
histidine residue of the PHQV motif is deleted and in which the R peptide of the TM 

25 subunit has been deleted. 

* AdelHRless corresponds to the amphotropic MLV envelope glycoprotein in which the 
histidine residue of the PHQV motif is deleted and in which the R peptide of the TM 
subunit has been deleted. 

* PROMOdelHRless corresponds to the amphotropic MLV envelope glycoprotein, in which 
30 the proline rich region has been replaced by the proline rich region of the ecotropic MLV 

envelope, in which the histidine residue of the PHQV motif is deleted and in which the R 
peptide of the TM subunit has been deleted. 
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* GALVdelHRless corresponds to the GALV envelope glycoprotein, in which the histidine 
residue of the PHQP motif is deleted and in which the R peptide of the TM subunit has been 
deleted. 

* C2delHRless corresponds to the amphotropic MLV envelope glycoprotein, in which the 
5 proline rich region has been modified by a small deletion at its carboxy terminus and in 

which the histidine residue of the PHQV motif is deleted (i.e. the 5th residue of the SU 
subunit is removed) and in which the R peptide of the TM subunit has been deleted. 

* CMO envelope corresponds to the amphotropic MLV envelope glycoprotein, in which the 
SU carboxy-terminal domain, C domain, has been replaced by the C domain derived from 

10 the ecotropic MoMLV envelope glycoprotein. 

* CMOdelH envelope corresponds to the amphotropic MLV envelope glycoprotein, in 
which the SU carboxy-terminal domain, C domain, has been replaced by the C domain 
derived from the ecotropic MoMLV envelope glycoprotein and in which the histidine 
residue of the PHQV motif is deleted (i.e. the 5th residue of the SU subunit is removed). 

15 * TMMO envelope corresponds to the amphotropic MLV envelope glycoprotein, in which 
the ectodomain of the TM subunit has been replaced by the TM ectodomain derived from 
the ecotropic MoMLV envelope glycoprotein. 

* TMMOdelH envelope corresponds to the amphotropic MLV envelope glycoprotein, in 
which the ectodomain of the TM subunit has been replaced by the TM ectodomain derived 

20 from the ecotropic MoMLV envelope glycoprotein and in which the histidine residue of the 
PHQV motif is deleted (i.e. the 5th residue of the SU subunit is removed). 

* CTMMO envelope corresponds to the amphotropic MLV envelope glycoprotein, in which 
both the SU carboxy-terminal domain, C domain, and the ectodomain of the TM subunit 
have been replaced by the C domain and TM ectodomain derived from the ecotropic 

25 MoMLV envelope glycoprotein. 

* CTMMOdelH envelope corresponds to the amphotropic MLV envelope glycoprotein, in 
which both the SU carboxy-terminal domain, C domain, and the ectodomain of the TM 
subunit have been replaced by the C domain and TM ectodomain derived from the ecotropic 
MoMLV envelope glycoprotein and in which the histidine residue of the PHQV motif is 

30 deleted (i.e. the 5th residue of the SU subunit is removed). 
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* 34.1-A envelope corresponds to the amphotropic MLV envelope glycoprotein, in which 
the receptor binding domain has been removed and replaced by an single chain antibody 
against human class I MHC molecules. 

5 The invention also relates to the use of at least one activating polypeptide and of at 

least one fusion defective envelope glycoprotein for the preparation of a drug useful for the 
treatment of cancer and infectious diseases. 

The invention also relates to products containing an activating polypeptide and a 
fusion defective envelope glycoprotein as defined above, as a combined preparation for 
10 simultaneous, separate, or sequential use for the rescue of the fusion defective property of 
said fusion defective envelope glycoprotein. 

The invention also relates to products containing an activating polypeptide and a 
fusion defective envelope glycoprotein as defined above, as a combined preparation for 
simultaneous, separate or sequential use for the treatment of cancer pathologies. 
15 The invention also relates, as new products, to activating polypeptides corresponding 

to a fragment of about 200 to about 250 amino acids of said surface subunit comprising the 
RBD and the PHQV peptide, in particular the fragments corresponding to amino acid 
sequences derived from type C mammalian retroviruses such as: 
- Murine leukemia virus such as : 
20 . amphotropic 

. ecotropic 
. xenotropic 
. polytropic 
. 10 Al MLV 

25 - or GALV type retroviruses and related retrovirus such as SSAV, FELV-B 

in particular wherein the activating polypeptide corresponds to one of the following amino 

acid sequences : 

A-RBD 

E-RBD 
30 GALV-RBD 

34-1-A-RBD 

34-1-E-RBD 
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34-1-GALV-RBD 
EGF-A-RBD 
EGF-E-RBD 
EGF-GALV-RBD 

5 The invention also relates, as new products, to fusion defective envelope 

glycoprotein to the amino sequence of an envelope glycoprotein of a virus or a retrovirus 
which in its native state presents attachment and fusion properties with respect to target 
cells, and comprises one or several mutations in the amino terminal part of its surface 
subunit, inactivating the fusion property without substantially affecting the attachment 
10 property, in particular comprises one or several mutations within the PHQ amino sequence 
of the amino terminal part of its surface subunit, wherein in particular the fusion defective 
envelope glycoprotein is derived from type C mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
15 . ecotropic 

. xenotropic 
. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV-B or 
20 FELV-C 

in particular wherein the fusion defective envelope glycoprotein corresponds to one of the 

following amino acid sequences : 

-AdelH 

- PROMOdelH 
25 - GALVdelH 

- C2delH 

- AdelHRless 

- PROMOdelHRless 

- GALVdelHRless 
30 - C2delHRless 

-CMO 

- TM MO 
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- C TM MO 
-CMOdelH 

- TM MOdelH 

- CTM MOdelh 
5 -34.1-A 

with the proviso that they are different from the ecotropic MLV sequence, wherein the 
amino acids in position 2 to 8 are either substituted or deleted. 

The invention also relates to the use of at least one activating polypeptide to the 
rescue of the fusion property of a fusion defective envelope glycoprotein wherein the fusion 
10 defective envelope glycoprotein corresponds to the amino acid sequence of a mutated form 
of an envelope glycoprotein of a virus or a retrovirus which, in its non mutated form, 
presents attachment and fusion properties with respect to target cells, said fusion defective 
envelope glycoprotein being advantagously derived from type C mammalian retroviruses 
such as: 

15 - Murine leukemia virus such as : 

. amphotropic 

. ecotropic 

. xenotropic 

. polytropic 
20 . 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, FELV- 
B, FELV-C and comprising : 

* one or several mutations in the amino terminal part of its surface subunit, 
25 particulary in the first amino acids of the amino terminal part of the surface subunit, the 
number of said first aminoacids varying from about 10 to about 15, said mutations 
inactivating the fusion property without substantially affecting the attachment property, in 
particular comprises one or several mutations within the PHQ amino-terminal sequence, 
wherein in particular the fusion defective envelope glycoprotein corresponds to one of the 
30 following amino acid sequences : 

- MO R less del H 
-MOdelH 
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- A R less del H 

- AdelH 

- PROMO R less del H 

- PROMOdelH 

5 -GALVR less del H 

- GALVdelH 

- C2 R less del H 

- C2delH 

- C MOdelH 
10 - TM MOdelH 

-CTMMOdelh 

* or the replacement of the ammo-terminal domain of its surface subunit (RBD) by a 
ligand such as a growth factor, or a cytokine or a single chain antibody, die extent of said 
IS mutations inactivating the fusion property and the attachement property to the normal viral 
receptor, in particular wherein the fusion defective envelope glycoprotein corresponds to the 
following amino-acid sequence : 

-34.1-A 

20 * or the replacement of its SU carboxy-terminal domain and / or its TM ectodomain, 

and preferably its SU carboxy-terminal domain, by the corresponding SU carboxy-terminal 
domain and / or the TM ectodomain of another "second" glycoprotein derived from type C 
mammalian retroviruses, with the proviso that said fusion defective glycoprotein and said 
second glycoprotein are not derived from the same type C mammalian retroviruses, in 

25 particular wherein the fusion defective envelope glycoprotein corresponds to one of the 
following amino-acid sequences : 
-CMO 

- TM MO 

- C TM MO 



30 
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FIGURE LEGENDS 

Fig 1A, IB and 1C represent titres of AdelH retroviruses in the presence of soluble 
envelope fragments of the invention. Retroviral vectors carrying VSV-G or/and wild-type 
5 amphotropic, A, envelope glycoproteins were used as controls. Titres expressed as lacZ 
infectious units (i.u.) per ml of viral supernatants. In figures 1A, IB and 1C, the titres are 
on the y-axis. 

Fig 1A. Different target cell types (XC, NIH3T3, CHO-PIT-2, TE 671, from left to right) 
were incubated with both lacZ retroviruses and conditioned media containing, or not 
10 (control), the indicated polypeptides during the 3 hrs of infection. The black columns 
correspond to the control, the grey columns correspond to A-RBD and the hatched columns 
correspond to SU polypeptide. 

Fig IB. XC target cells were incubated with dilutions of A-RBD (x axis) and infected with 
LacZ retroviruses carrying the AdelH envelope glycoproteins (curve with black circles), A 
15 envelope glycoproteins (curve with black squares) or VSV-G envelope glycoproteins (curve 
with hollow squares). 

Fig 1C. Retroviral vectors carrying A envelope glycoproteins (curves with black squares) 
or AdelH envelope glycoproteins (curves with black circles) were incubated with XC target 
cells for 1 hr at 4°C to allow virion binding while preventing cell entry. After PBS washing 

20 to remove unbound retroviruses, cells were incubated at 37°C with undiluted A-RBD- 
containing supernatants for the indicated times of dilution represented on the x axis, the 
titres being expressed on the y axis. A-RBD was then eliminated from the cell supernatant 
by washing the cells four times with 1 ml PBS (resulting in dilution of unbound A-RBD by 
more than 100,000 times). Cells were then grown in regular medium for 2 days before X- 

25 Gal staining. 

Fig 2. represents the titers of A-RBD-mediated activation of AdelH retroviruses. A- 
RBD (used undiluted) was bound to XC target cells at 37°C for 30 min. Unbound A-RBD 
was removed and cells were washed two times with PBS and incubated at 37°C. At the 
30 indicated times, cells were further washed two times with PBS and incubated at 37°C for 3 
hrs with retroviruses carrying A envelope glycoproteins or AdelH envelope glycoproteins. 
Titers, determined 2 days later by X-Gal staining, are expressed as lacZ i.u./ml (y-axis), as 
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a function of time before infection (x-axis). The curve with black circles corresponds to 
retrovirus carrying A envelope glycoproteins and the curve with black squares corresponds 
to retrovirus carrying AdelH envelope glycoproteins. 

5 Fig 3. represents titers of AdelH retroviruses after ultrafiltration. Titers expressed on 
the y-axis (as lacZ i.u./ml) of retroviruses carrying wild-type, A, or AdelH envelope 
glycoproteins before (1) or after incubation with A-RBD (2), and following A-RBD elution 
on two successive 700 KD-cut off ultrafiltration cartridges (3). After elution AdelH 
retroviruses were subjected to re-stimulation with fresh A-RBD (4). 

10 

fig. 4A and 4B represent infection assays of AdelH and MOdelH retroviruses in the 
presence of RBD or RBDdelH polypeptides. 

Fig. 4A. Retroviruses carrying A (upper left) or AdelH (upper right) envelope 
glycoproteins were mixed with A-RBD (hatched column) or A-RBDdelH (grey column) 
15 during infection of XC cells. Retroviruses carrying MO (lower left) or MOdelH (lower 
right) envelope glycoproteins were mixed with E-RBD (hatched column) or E-RBDdelH 
(grey column) during infection of XC cells. Titers are expressed as lacZ i.u./ml on the y- 
axis. In Figure 4A, the black column corresponds to control. 

Fig. 4B. represents binding assays of A-RBD (solid lines) and A-RBDdelH fragments 
20 (broken lines) on Cearl3 (top), and binding assays of E-RBD (solid lines) and E-RBDdelH 
fragments (broken lines) on Cearl3 cells (bottom). The background of fluorescence (filled 
histograms) was provided by incubating the cells with supernatants devoid of envelope 
fragments. 

Fig 5A and SB represent the cross-activation of AdelH and MOdelH retroviruses by 
25 MLVRBDs. 

Fig. 5A. Titers of retroviruses carrying AdelH or MOdelH envelope glycoproteins in the 
presence of the indicated RBDs. Titers are determined (y-axis) on XC target cells as lacZ 
i.u./ml. On figure 5A, the black column corresponds to control, the hatched column 
corresponds to A-RBD and the grey column corresponds to E-RBD. 
30 Fig. SB. Titers of AdelH retroviruses (y-axis) on CHO cells expressing, or not (control), 
PiT-2 and/or mCAT-1 MLV receptors in the presence of the indicated RBDs. On figure 5B, 
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the hatched column corresponds to E-RBD, the grey column corresponds to A-RBD and the 

black column corresponds to control. 

Fig, 6 represents the A-RBD sequence 

Fig, 7 represents the E-RBD sequence 
5 Fig. 8 represents the GALV-RBD sequence 

Fig. 9 represents the EGF-A-RBD sequence 

Fig. 10 represents the EGF-E-RBD sequence 

Fig. 11 represents the EGF-GALV-RBD sequence 

Fig. 12 represents the 34.1 -A-RBD sequence 
10 Fig. 13 represents the 34. 1-E-RBD sequence 

Fig. 14 represents the 34. 1 -GALV-RBD sequence 

Fig. 15 represents the AdelH sequence 

Fig. 16 represents the PROMOdelH sequence 

Fig. 17 represents the GALVdelH sequence 
15 Fig. 18 represents the C2delH sequence 

Fig. 19 represents the AdelHRless sequence 

Fig. 20 represents die PROMOdelHRless sequence 

Fig. 21 represents the GALVdelHRless sequence 

Fig. 22 represents the C2delHRless sequence 
20 Fig. 23 represents the C MO sequence 

Fig. 24 represents the C MO delH sequence 

Fig. 25 represents the TM MO sequence 

Fig. 26 represents the TM MOdelH sequence 

Fig. 27 represents the C TM MO sequence 
25 Fig. 28 represents the C TM MO delH sequence 

Fig. 29A represents the schematic structure of chimeric amphotropic envelope glycoproteins 

in which the C domain (CMO chimera), the TM ectodomain (TMMO chimera) or both C 

and TM domains (CTMMO chimera) are replaced by the ecotropic counterpart derived 

from MoMLV envelope glycoprotein. The CMOdelH, TMMOdelH and CTMMOdelH 
30 chimeric envelope glycoproteins harbor the H5del mutation. 
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Figure 29B represents the kinetic of infection of retroviral vectors carrying the wild-type 
amphotropic envelope glycoprotein (A) or the CTMMO amphotropic chimera on XC cells 
in the presence, or not, of soluble polypeptides encompassing the amphotropic RBD (A- 
RBD) or the ecotropic RBD (E-RBD), as indicated. 
5 The curve with points and black circles corresponds CTMMO on XC cells + E-RBD 
The curve with points and hollow circles corresponds to A on XC cells + E-RBD 
The curve with continuous line and hollow circles corresponds to A on XC cells 
The curve with dotted line and hollow circles corresponds to A on XC cells 4- A-RBD 
The curve with continuous line and black circles corresponds to CTMMO on XC cells 
10 The curve with dotted line and black circles corresponds to CTMMO on XC cells 4- A- 
RBD 

The X axis corresponds to the time expressed in seconds. 
The Y axis corresponds to the number of infected cells. 
Fig, 30 represents the 34.1-A Sequence 

15 

Example 1: 

MATERIALS AND METHODS 

Cell lines. The TELCeB6 cell line (Cosset et al., 1995b) has been derived from the TELac2 
20 line after transfection and clonal selection of a Moloney murine leukemia virus (MoMLV)- 
based expression plasmid to produce Gag and Pol proteins. The TELac2 cells were 
originally derived from the TE671 human rhabdomyosarcoma cells (ATCC CRL8805) to 
express the nlsLacZ reporter retroviral vector (Takeuchi et al., 1994). Production of 
infectious retroviral particles by TELCeB6 cells depends on newly introduced envelope 
25 expression vectors. 

Cerd9 and Cearl3 cells (Kozak et al., 1995) and CHO-PiT-2 cells (Rodrigues and Heard, 
1999) were derived from CHO (Chinese hamster ovary) cells (ATCC CCL-61) and express 
ecotropic MLV receptors alone or both ecotropic and amphotropic receptors or amphotropic 
MLV receptors alone, respectively. 
30 XC-A-RBD cells were derived from XC rat sarcoma cells (ATCC CCL-165) after 
transfection with the pA-ST plasmid expressing the amino terminal receptor binding domain 
of the amphotropic envelope glycoprotein (Battini et al., 1996). 
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TE671 and TELCeB6 cells were grown in DMEM (Life-Technologies) supplemented with 
10% fetal bovine serum. NIH-3T3 mouse fibroblasts were grown in DMEM supplemented 
with 10% new-born calf serum. Cerd9, Cearl3, CHO-PiT-2 and CHO cells were grown in 
DMEM supplemented with 10% fetal bovine serum and with proline (Life-Technologies). 

5 

Construction of envelope expression vectors. Plasmids FBASALF and FBMOSALF 
carrying a phleomycin-resistance gene and encoding the MLV-4070A amphotropic (noted as 
A) and MoMLV ecotropic (noted as MO) envelope glycoproteins, respectively, have been 
described elsewhere (Cosset et al., 1995a) and were used as backbones for construction and 

10 expression of envelope mutants. All constructs were generated by PCR-mediated and 
oligonucleotide site-directed mutagenesis (details and sequences available upon request). 
The FBASALF plasmid was modified to produce a cell entry-defective form of the 
amphotropic glycoprotein, designated AdelH envelope, by deleting the 36th codon of the 
4070A env gene (Ott et al., 1990). The resulting mutant envelope glycoprotein, in which 

15 the 5th residue of the SU envelope subunit was removed, was named AdelH. The 
expression plasmid encoding the fusion-defective MOdelH envelope glycoproteins, 
harboring a deletion of the 8th residue of the SU envelope subunit corresponding to the 41th 
codon of the MoMLV env gene (Shinnick et al., 1981), was derived from FBMOSALF. 
Plasmids encoding soluble receptor binding domains (RBD) were derived from FBASALF 

20 and FBMOSALF expression vectors. The boundaries of either amphotropic (A-RBD) and 
ecotropic (E-RBD) RBDs, defined as A32244 and A34269, were fused in frame with the 
carboxy-terminal ends of either envelope glycoproteins by fusion to aminoacids R623 or 
R634, respectively. Residues are numbered starting from the initiation methionine deduced 
from the aminoacid sequences of the 4070A amphotropic MLV (Ott et al., 1990) or the 

25 Moloney MLV (Shinnick et al., 1981). Expression vectors encoding either A-RBDdelH or 
E-RBDdelH were generated similarly by using the plasmids expressing the AdelH or 
MOdelH envelope glycoproteins. In plasmids encoding A-RBD, E-RBD, A-RBDdelH or E- 
RBDdelH polypeptides, the 11 aminoacid-lohg VSV tag (YTDDEMNRLGK) (Kreis and 
Lodish, 1986) was inserted after either RBDs. 

30 

Transfections, production of RBDs and infection assays. Envelope glycoproteins 
expression plasmids were transfected into TELCeB6 as reported elsewhere (Cosset et al., 
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1995a). RBD expression vectors were transfected by calcium phosphate precipitation in 
NM-3T3, XC or TE671 cells. Transfected cells were selected with phleomycin (50 \iglM) 
and phleomycin-resistant colonies were pooled. 

RBD-containing supernatants were collected after an overnight production from confluent 
5 RBD-transfected cells, filtrated trough 0.45 micron pore-sized membranes and stored at 
4°C. Virus-containing supernatants were collected after an overnight production from 
freshly confluent env-transfected TELCeB6 cells in regular medium. 
The supernatant were filtered trough 0.45 micron pore-sized membranes and diluted in 
DMEM for titration assays. Target cells were seeded in 24 multi-well plates at a density of 

10 5xl0 4 cells per well and incubated overnight at 37°C. Unless otherwise indicated in figure 
legends, 200 |il of conditioned cell culture media containing the RBDs were added to the 
cells after removing their supernatants. Then 200 jj.1 of viral supernatant dilutions containing 
5 jig/ml polybrene were added and cells were incubated for 3 hrs at 37 °C. Cell supernatants 
were then removed and cells were incubated in regular medium for 48 hrs. X-Gal staining 

15 and viral titer determination were performed as previously described (Cosset et al., 1995a) 
and expressed as lacZ i.u. (infectious units)/ml. 

Antibodies. Anti-SU: a rat monoclonal antibody 83A25 (Evans et al., 1990) cell culture 
supernatant against MLV SU used undiluted for FACS analysis. Anti-VSV tag: a purified 
20 mouse monoclonal antibody P5D4 (SIGMA-Aldrich) used diluted 1/100 for FACS analysis. 

Binding assays and FACS analysis. For binding assays, target cells were washed in PBS 
and detached by a 10 min incubation at 37°C with versene 0.02% in PBS. Cells were 
washed in PBA (PBS with 2% FCS and 0.1% sodium azide). 5x10 s cells were incubated 

25 with 1 ml of conditioned supernatants containing the VSV tagged RBDs for 45 min at 37 °C. 
Cells were then washed with PBA and were incubated with the P5D4 antibody, for 45 min 
at 4°C. Cells were washed twice with PBA and incubated with anti-mouse FITC-conjugated 
antibodies (DAKO, U.K.), respectively. 5 min before the two final washes in PBA, cells 
were counter-stained with 20 ^g/ml propidium iodide. Fluorescence of living cells was 

30 analysed with a fluorescent-activated cell sorter (FACSCalibur, Beckton Dickinson). 
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RESULTS 

H5del phenotype can be efficiently compensated by soluble A-RBD. Viral particles were 
generated with either wild-type amphotropic A or AdelH envelope glycoproteins and used to 
infect target pells in the presence or in the absence of soluble SU or soluble A-RBD, an 
5 envelope fragment encompassing the amphotropic receptor binding domain. When SU or A- 
RBD were provided during infection the infectivity of retroviruses carrying wild-type 
amphotropic envelope glycoproteins was decreased by approximately 5-100 times (Fig. 1A), 
most likely owing to the partial blocking of PiT-2 receptors on the target cell surface. In 
striking contrast, while in the absence of envelope fragments retroviruses generated with 

10 AdelH envelopes were very poorly infectious, with titers in the range of lO^lO 2 iacZ i.u. 
(infectious units)/ml, the presence of SU or of A-RBD in the medium of infected cells could 
strongly stimulate the infectivity of the former retroviruses by up to 30,000 fold (Fig. 1A). 
Stimulation of AdelH infection with either soluble envelope fragments was detected on a 
wide range of target cell types including rat XC cells, 3T3 murine fibroblasts, TE671 

15 human cells and PiT-2-transfected CHO hamster cells, though with different efficacies (Fig. 
1A). Significant differences were detected when soluble SU or A-RBD were used to activate 
the AdelH retroviruses. Comparatively to soluble SU, A-RBD was 100 to 1,000 fold more 
potent to activate AdelH retroviruses (Fig. 1A). These results indicated that the 
determinants of activation were located in the first half of the SU protein. Therefore to 

20 facilitate the characterization of this activation pathway, the subsequent experiments were 
performed by using the soluble RBD. A-RBD could fully activate AdelH retroviruses, when 
diluted 10 fold and retained 90% of activity at a 1:100 dilution, thus suggesting that it was 
effective at very low doses (Fig. IB). In contrast the infectivity of control retroviruses 
pseudotyped with either wild-type amphotropic envelopes or with VSV-G glycoproteins was 

25 weakly increased or unchanged, respectively, when A-RBD was diluted (Fig. IB). 
Additionally the origin of the cell types used to produce the envelope fragments did not 
influence the stimulation of infectivity since conditioned media harvested from 3T3, TE671 
or XC cells that were engineered to express the envelope fragments could similarly activate 
the infectivity of AdelH retroviruses (data not shown). Finally activation of AdelH 

30 retroviruses was found to be the result of a specific interaction of A-RBD with the PiT-2 
amphotropic receptor. Indeed while A-RBD could efficiently rescue the infectivity of AdelH 
retroviruses on PiT-2-transfected CHO cells, no infection could be detected when using 
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parental CHO cells as target cells (see below and Fig. 5A). Altogether these data 
demonstrated that activation of AdelH retroviruses by A-RBD was specific. 
The capacity of the soluble envelope fragments to rescue the infectivity of AdelH 
retroviruses was similar whether the target cells i) constitutively expressed and secreted the 
5 envelope fragments, ii) were co-incubated with both AdelH retroviruses and A-RBD, iii) 
were pre-incubated with A-RBD prior to infection, or iv) were pre-incubated with the 
AdelH retroviruses before adding A-RBD (Table 1). Using this latter experimental 
condition, the kinetic of activation of infection was found very rapid. Indeed a brief 
incubation of target cells with A-RBD, by no more than 30 sec, was found sufficient to fully 

10 stimulate the infectivity of pre-bound AdelH retroviruses (Fig. 1C). 

A-RBD stimulates the target cell membrane rather than the AdelH envelope. In theory, 
it is possible that the envelope fragments might rescue the infectivity of AdelH retroviruses 
either by interacting with the fusion-defective viral particles or by activating the target cell 
membrane upon receptor binding. Indeed within the envelope complex, the SU subunits are 

15 not tightly attached to either the TM subunits or to the other SU units and can easily shed 
off the viral particles (Hunter and Swanstrom, 1990). However the reassociation of shed SU 
with the viral particle is unlikely since they are incorporated on virions by virtue of their 
association with the TM subunit that results from their synthesis as a common SU-TM 
polypeptide precursor (Hunter and Swanstrom, 1990). Therefore it is unlikely that soluble 

20 SU or A-RBD mixed with AdelH retroviruses may interact with the envelope complex of 
the latter retroviruses and thus rescue their fusion defect. Nevertheless the following 
experiments were performed to address this possibility. 

A-RBD was bound to target cells at 37°C for 30 min. After removal of unbound fragments 
by washing target cells, binding of A-RBD was verified by FACS analysis. Cells were then 

25 incubated at 37°C for 0 (TO) to 28 hrs (T28) to allow intemalisation of receptor/A-RBD 
complexes (Rodrigues and Heard, 1999) and A-RBD disappearance from the cell surface, as 
checked by FACS analysis (Data not shown). Cells were then washed again to remove A- 
RBD that may have been released by the cells and the AdelH retroviruses were then added 
at the different time points. Comparatively to infection performed at TO, no significant 

30 decrease of infectivity could be detected when the AdelH retroviruses were added until 8 
hrs after the initial binding with A-RBD. Likewise the infectivity of AdelH retroviruses 
added at T20 was at least 50 fold higher than before pre-stimulation of the cells with A- 
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RBD, thus indicating that stimulation of infectivity was durable (Fig. 2). Since A-RBD was 
efficiently internalised and removed from the cell surface after 4 hrs of incubation (Data not 
shown), these results indicated that activation of AdelH retroviruses most likely occurred by 
activation of the cell surface rather than by interaction with the AdelH envelope complex 
5 itself. 

Further evidence indicated that A-RBD could not associate with the AdelH retroviruses. 
Indeed retroviruses generated with either A or AdelH envelopes and mixed or co-expressed 
with A-RBD were not found to incorporate A-RBD as shown by western-blotting of purified 
viral particles with antibodies recognising A-RBD (data not shown). However these 

10 experiments could not exclude that undetected quantities of A-RBD might be associated with 
the virions. Therefore to rule out this possibility, a mixture of A-RBD and retroviruses 
carrying either AdelH or wild-type A envelope glycoproteins were separated on two 
consecutive 700 KD-pore sized ultrafiltration columns in order to elute out A-RBD. No 
decrease of infectivity could be detected for retroviruses carrying wild-type amphotropic 

15 envelopes after the two subsequent ultrafiltrations (Fig. 3), demonstrating that this process 
did not affect the viability of the retroviruses. However while before ultrafiltration the 
AdelH viral particles mixed with A-RBD could efficiendy infect XC target cells, as 
expected, only a residual infectivity could be detected after A-RBD elution (Fig. 3), 
indicating that stimulation of AdelH retroviruses was lost after separation from A-RBD. As 

20 control, the AdelH retroviruses processed through the two columns retained their full 
capacity to be stimulated by newly added A-RBD (Fig. 3). Similar experiments were 
conducted after separation of AdelH retroviruses from A-RBD by sephacryl 
chromatography and led to the same conclusions (data not shown). 

Altogether these data indicated that stimulation of infectivity of the AdelH retroviruses 
25 proceeded via activation of the target cell membrane, most likely as a consequence of 
interaction of A-RBD with the PiT-2 amphotropic receptor, rather than via interaction with 
the AdelH envelope incorporated on the retrovirus itself. Thus A-RBD could activate the 
target cell membrane which in turn became competent to allow entry of retroviruses 
carrying the fusion-defective AdelH envelope glycoproteins. 
30 The SU ammo-terminal end activates a cell entry pathway common to type C 
mammalian retroviruses. The histidine residue deleted in the H5del mutant glycoprotein 
belongs to a peptide motif, PHQ, found at the amino-termini of the SUs of all type C 
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mammalian retroviruses, suggesting a conserved function of this motif. Thus the H8del 
mutant ecotropic envelope glycoprotein (MOdelH), harboring the deletion of the histidine in 
the PHQV peptide motif of the MoMLV SU, is impaired for both cell-cell and virus-cell 
fusion (Bae et al., 1997). Similarly the H9del mutation of the GALV envelope glycoprotein 
5 GAdelH mutant results in impairement of both cell-cell and virus-cell fusion. Further 
experiments were performed to demonstrate that MOdelH, AdelH and GAdelH (delH) 
envelopes were phenotypically similar. Indeed the E-RBD fragment, containing the 
MoMLV ecotropic receptor binding domain, could rescue the infectivity of MOdelH 
retroviruses by up to 1,000 fold (Fig. 4A). Similar results were obtained for GAdelH 

10 retrovirus, whose fusogenicity could be efficiently rescued by soluble envelope fragment 
encompassing the GALV envelope glycoprotein receptor-binding domain. This result 
indicated that fusion activation by soluble envelope fragments was not particular to AdelH 
envelopes and suggested that the MLV SU amino-terminus may convey a signal common to 
other type C mammalian retrovirus which may be required to achieve early post-binding 

15 events. 

To address this hypothesis tike delH mutations, H5del, H8del, were introduced in the A- 
RBD or in the E-RBD fragments, respectively. The resultant A-RBDdelH and E-RBDdelH 
could bind their respective cell surface receptors as efficiently as the parental envelope 
fragments (Fig. 4B). They could also decrease the infectivity of retroviruses bearing wild- 

20 type amphotropic or ecotropic glycoproteins, respectively, by up to 10-fold (Fig. 4A). This 
effect was most likely a consequence of partial receptor blocking. However in contrast to A- 
RBD or to E-RBD, neither the A-RBDdelH nor the E-RBDdelH fragments could rescue the 
infectivity of AdelH or MOdelH retroviruses, respectively (Fig. 4A). These data therefore 
demonstrated that the integrity of the amino-terminal ends of either RBD fragments was 

25 absolutely required to activate postbinding functions and suggested that an essential 
determinant of cell membrane activation may reside at the amino-terminal end of the MLV 
RBD. Thus the so-called "receptor binding domain" located in the amino-terminal half of 
the MLV SU may in fact be composed of two different entities, one involved in specific 
receptor binding and a second, likely to be non functional in AdelH or MOdelH envelopes, 

30 involved in transmission of a signal that may activate cell entry upon receptor binding. 

Therefore to test whether the binding function and the activating function of the MLV 
receptor binding domain could be uncoupled, cells "infected" with AdelH or MOdelH 
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retroviruses were cross-incubated with either E-RBD or A-RBD polypeptides, respectively. 
Interestingly the infectivity of MOdelH retroviruses could be activated by either A-RBD or 
E-RBD fragments (Fig. 5A). This cross-activation was reciprocal since either fragments 
could activate, with similar efficacies, the infectivity of AdelH retroviruses (Fig. 5A). The 

5 cross-reactivity of either RBDs was dependent on the presence of both mCAT-1 and PiT-2 
receptors on the cell surface since neither PiT-2-transfected CHO cells nor mCAT-1- 
transfected CHO cells could be infected when E-RBD was used to activate the infectivity of 
AdelH retroviruses (Fig. 5B) and vice-versa (data not shown). The specificity of activation 
of AdelH/MOdelH retroviruses by ecotropic envelope fragments was further demonstrated 

10 by using the MOD84K-ST envelope fragment, harboring the D84K point mutation that 
inactivates MoMLV envelope binding (MacKrell et al., 1996). This fragment could activate 
neither AdelH nor MOdelH retroviruses despite the presence of both amphotropic and 
ecotropic receptors on target cells (Fig. 5A). Cross-activation of AdelH retroviruses could 
also be obtained by using soluble envelope glycoproteins from Gibbon Ape leukemia virus 

15 (data not shown), thus indicating that infection by delH retroviruses could be activated by a 
pathway common to type C mammalian retrovirus. 

Table 1. Activation of AdelH retroviruses by A-RBD 



20 Cells Pre-binding* binding and infection 6 Titre c 



XC - lacZ(AdelH) 5x10* 

XC - lacZ(AdelH) d + A-RBD d 3.1x10 s 

XC-A-RBD e - lacZ(AdelH) 9x10* 

25 XC A-RBD lacZ(Adeffl) 1.1x10* 

XC lacZ(AdelH) A-RBD 2.7x10 s 



a: Target cells were pre-incubated for one hr at 4°C with conditioned supernatants 
containing A-RBD (used undiluted) or with lacZ retroviral vectors carrying AdelH envelope 
30 glycoproteins, as indicated. 

b: Target cells were incubated for 3 hrs at 37°C. 

c: Titres as lacZ i.u./ml determined 48 hrs post-infection. 
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d: Both A-RBD (used undiluted) and AdelH retroviruses were added at the same time, 
e: XC cells constitutively expressing A-RBD polypeptides. 

Example 2: Control of syncytia formation by RBD polypeptides 

5 

Ideally, the transfer of therapeutic genes into tumour cells should lead both to direct 
local cell destruction and to activation of anti-tumour immunity to clear tumour deposits to 
which the genes cannot be delivered. The most commonly used genes for the control of 
local tumour growth have been the suicide gene/prodrug systems such as Herpes Simplex 

10 Virus thymidine kinase (HSVtk)/Ganciclovir (GCV) or cytosine deaminase (CD)/5FC 
systems. Importantly, these systems not only kill gene-transduced target cells but also close 
neighbours via local bystander effects. These effects depend upon cell-cell communication 
or on the diffusion of toxic metabolites, but generally act only over short ranges. 
Nonetheless, the existence of such bystander effects are crucial to compensate for the 

15 relatively poor efficiencies of gene transfer that are currently possible. However, results 
from early clinical trials have demonstrated that both better delivery systems and more 
effective genes are clearly still required to improve the efficiency of both local tumour 
control and of anti tumour immune responses. Hence, therapeutic genes with greater local 
killing activity would represent a significant advance for gene transfer strategies for cancer. 

20 It has been recently demonstrated that the well-known ability of several viral 

fosogenic membrane glycoproteins (FMG) to fuse target cells could be exploited 
therapeutically to kill tumour cells. These previous results introduced the FMG as a novel 
class of therapeutic genes which are more potent than existing suicide gene systems in 
killing tumour cells and do so by mechanisms which are highly immunostimulatory. Table 2 

25 shows the results of four FMGs derived from the non fusogenic ecotropic or amphotropic 
murine leukemia virus glycoproteins, whose fusogenicity were increased by deleting the 
p2R peptide (naturally removed during retrovirus egress) for the ARless and MoRless FMG 
and also by modifying the proline-riche region (Lavillette et al., 1998) for the C2Rless and 
PROMORless FMGs. 

30 The four resulting FMGs, Arless, C2Rless and PROMORless and MORless induced 
syncytia formation in most of the cell types that express amphotropic receptors and/or 
ecotropic receptors. However for selective eliminations of cells as required in cancer gene 
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10 



15 



therapy protocols, the design of a mechanism that can control the formation of syncyia is 
required. We therefore introduced in all four FMGs the H5del or the H8del mutation (delH 
mutation) which removes the 5th or the 8 th residue of the amphotropic or the ecotropic SU, 
respectively, an histidine belonging to the PHQV conserved motif. This mutation could 
inactivate an early event of the fusion process after receptor binding (Example 1 and 
Lavillette et al., 2000). As previously demonstrated the virus-cell fusion activity of delH 
MLV envelopes bearing the delH mutation can be efficiently rescued by using polypeptides 
encompassing the first 213 amino-acids of amphotropic MLV SU (A-RBD) or the first 236 
amino-acids of ecotropic MoMLV SU (E-RBD) (example 1 and Lavillette et al., 2000). 
Consistently with Example 1, the data in Table 2 demonstrated that the cell-cell fusion 
activity of the four FMGs engineered to harbor the delH mutation, AdelRless, 
C2delHRless, and PROMOdelHRless and MOdelHRless, could be rescued by providing 
either the A-RBD or E-RBD polypeptides thus leading to syncytia formation and elimination 
of target cells that express the relevant viral receptors. Interestingly formation of syncytia 
by delH FMGs could be achieved by using an heterologous polypeptide (E-RBD) that 
interacted with a receptor different of that recognised by the delH FMGs. 



Table 2. Modulation of cell-cell fusion by soluble envelope fragments 



25 



35 



Envelope RBD 


Syncytia 


delH envelope 


RBD 


Syncytia 


A 










ARless 


+ + 


AdelHRless 












A-RBD 


+/- 








E-RBD 


+ 


C2Rless 


+ + + 


C2delHRless 












A-RBD 


+/- 








E-RBD 


+ + 


PROMORless - 


+ +++ 


PROMOdelHRless 




+/- 








A-RBD 


+ + 








E-RBD 


+ + + + 


MORless 


+ + + + 


MOdelHRless 












A-RBD 


+ + + + 








E-RBD 


+ + + + 



fusion indexes (calculated according to Lavillette et al., 1998) -: less than 100, +/-: 100-304, 
+ : 300-1000, + + : 1000-10 000, + + + : 10 000-15 000, + + ++: >15000 
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Of note, similar results of fusion activation with RBD polypeptides were obtained 
with the GALVdelHRless envelope glycoprotein, indicating that the delH mutation of the 
potent GALVRless FMG (Bateman et al., 2000) can be controlled by adding a soluble Env- 
derived polypeptide (Lavillette and Cosset, unpublished results). These data indicate that it 
is possible to exploit a well-known phenomenon - the fusion of virally infected cells with 
uninfected cells as a novel, potent cytotoxic therapy using gene transfer. There are several 
options to exploit the use of delH FMGs in cell elimination stategies. Firstly viral vectors 
such as adenovirus vectors can be used to transduce both the delH FMG and the gene 
encoding the activating RBD polypeptide in cancer cells in vivo. Secondly cells transduced 
in vitro or ex vivo by a vector expressing a delH fusion-defective FMG can be induced to 
form syncytia after inoculation of an RBD presented as a protein preparation. In addition as 
well as using direct gene transfer, delH FMG and RBD can be effectively delivered through 
cell-mediated mechanisms, which will also be useful as an option for in vivo delivery of 
FMG into tumours for both local tumour eradication as well as enhancement of anti tumour 
vaccination strategies. 

Example 3: Addition of ligands on RBD polypeptides 

20 To further control the fusion activation of FMGs by RBD polypeptides, a ligand was 

inserted at the ammo-terminal end of the RBD. The anti-MHC-I 34.1 scFv was chosen as 
we previously demonstrated that it could redirect the host-range of retroviral vectors 
carrying 34. 1-ecotropic MLV chimeric envelope glycoproteins and allow specific infection 
of human cells (Marin et al. 1996). Therefore the 34. 1 scFV was fused to A-RBD or to E- 

25 RBD and preparation of 34.1 -A-RBD or 34.1-E-RBD polypeptides were obtained. 
Retroviral vectors carrying wild-type envelope glycoproteins, A, or AdelH fusion-defective 
amphotropic envelope glycoproteins were used to infect target cells in the presence of A- 
RBD, E-RBD, 34.1-E-RBD or 34.1-A-RBD polypeptides (Table 3). While either 
polypeptides could only weakly decrease the infectivity of parental retroviruses, they could 

30 strongly stimulate the infectivity of AdelH retrovirus. No significant differences could be 
detected when comparing the A-RBD, the E-RBD, the 34.1-E-RBD or the 34.1-A-RBD 
polypeptides in activation of AdelH retrovirus infectivity (Table 3). 
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Table 3. Modulation of virus-cell fusion by ligand-displaying 
envelope fragments 



5 Envelope RBD fragment Titers 



A - 8.1xl0 6 

A-RBD 5.6xl0 5 

E-RBD 8xl0 6 

10 34.1-A-RBD 1.35xl0 6 

34.1-E-RBD 3xl0 6 

AdelH - 2X10 1 

A-RBD 8.5xl0 4 

15 E-RBD 8x10 s 

34.1-A-RBD 7.3xl0 4 

34.1-E-RBD 7x10 s 



20 These data therefore suggest that it is possible to control the fusion activity of FMGs by 
adding specific ligands on the activating polypeptides. Therefore the design of a specific 
system to eliminate unwanted cells may consist in a delH FMG associated to an RBD 
polypeptide that displays a ligand that can specifically recognise a receptor expressed on the 
target cells. Table 4 shows that 34.1-A-RBD and 34.1-E-RBD could activate the cell-cell 

25 fusion activity of the PROMOdelHRless FMG with an efficiency similar to that of A-RBD 
or E-RBD. 

Altogether these results indicate that since the 34.1-E-RBD or 34.1-A-RBD 
polypeptides could cross-activate the fusogenicity of AdelH, AdelHRless, or 
30 PROMOdelHRless envelope glycoproteins on cells that express the targeted antigen (the 
MHC class I molecules) but no receptor corresponding to either the chimeric RBD or the 
delH retrovirus (Tables 3 & 4), efficient retargeted infection or cell-type specific syncytia 
formation could be obtained. 



35 
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Table 4. Modulation of cell-cell fusion by 
ligand-displaying envelope fragments 



Envelope 


RED 


Syncytia 


A 


- 


18 


ARless 


- 


4464 


ARlessdelH 




14 




RBD 


1014 


PROMORless 




14796 


PROMORlessdelH 




304 




A-RBD 


18920 




E-RBD 


17000 




34.1-A-RBD 


19000 




34.1-E-RBD 


20000 



Example 4: Activation of Env fusogenicity by RBD and C/TM domains 
20 interaction 

The H5del mutation was introduced into the CMO, the TMO and the CTMMO 
MLV amphotropic envelope glycoprotein (Lavillette et aL, in preparation), derived from the 
MLV 4070A envelope glycoprotein by substitution of the SU carboxy-terminal domain (C 

25 domain) and/or TM ectodomain with those derived from the MLV ecotropic envelope 
glycoprotein (Figure 29 A). Recent evidence from the literature suggest that the C domain of 
MLV envelope glycoproteins is involved in activation and/or control of the fusion 
machinery contained in the TM subunit of the envelope glycoprotein (Nussbaum et aL, 
1993; Pinter et aL, 1997). Retroviruses carrying either the parental or the delH-mutated 

30 CMOdelH, TMOdelH and CTMMOdelH chimeric envelope glycoproteins were prepared 
and their infectivity was assayed in the presence or the absence of A-RBD or E-RBD 
soluble polypeptides (Table 5). While the infectivity of AdelH control retroviruses could 
easily be rescued when either A-RBD or E-RBD polypeptides were present in the infection 
mixture (Lavillette et aL, 2000), the infectivity of CMOdelH, TMMOdelH and 

35 CTMMOdelH retroviruses could not be rescued with the A-RBD polypeptide although it 
was efficiently rescued with the E-RBD polypeptide (Table 5). These results indicated that 
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the fiisogenicity of the CMOdelH, TMMOdelH and CTMMOdelH envelope glycoproteins 
could only be activated by soluble RBD polypeptides derived from the same envelope 
glycoprotein from which the C and/or TM domain were inserted in the delH chimera. 
Further evidence for the requirement of a compatibility between the RBD and the C and/or 
TM domains were provided by the finding that the CTMMO retroviruses displayed a 
significantly slower kinetic of infection comparatively to that of retroviruses carrying wild- 
type MLV amphotropic envelope glycoproteins (Figure 29B). In the absence of exogenous 
RBD in the infection mixture, the replication retroviruses carrying the CTMMO envelope 
glycoproteins was delayed, most likely owing to the unability of the MLV 4070A-derived 
RBD of the latter to activate their MoMLV-derived C and/or TM domains. Yet the addition 
in the infection mixture of exogenous RBD polypeptides derived from the ecotropic 
MoMLV envelope glycoprotein (E-RBD), matching the C and/or TM domains of the 
CTMMO chimera could restore the infection kinetic of CTMMO retroviruses to a level 
similar if not identical to that of wild-type amphotropic retroviruses (Figure 29B). In 
contrast the kinetic of infection of retroviruses carrying wild-type amphotropic envelope 
glycoproteins was not increased in the presence of the E-RBD polypeptide. Additionally, 
infectivity with retroviruses carrying either wild-type amphotropic or CTMMO envelope 
glycoproteins was strongly decreased or inhibited, respectively, when infection was 
performed in the presence of A-RBD (Figure 29B). This inhibitory effect was most likely 
due to A-RBD-mediated partial blocking of amphotropic receptors in the case of wild-type 
amphotropic envelope glycoproteins and both receptor interference and incompatibility 
between this MLV amphotropic Env-derived fragment and the ecotropic C domain in the 
case of CTMMO chimera. 



Table 5 

Infectivity in the presence of : 
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noRBD 


A-RBD 


E-RBD 


10 6 


10 4 


10 6 


10 1 


10 5 


10 5 


10 6 


10 2 


10 6 


10° 


10° 


10 5 


10 6 


10 3 


10 6 


10° 


10 3 


10 5 


10 6 


10 1 


10 6 


10° 


10° 


10 5 


10 6 


10 6 


10* 



Example 5: Retargeted gene delivery using RBD fragments. 

We have therefore demonstrated that the RBD of a retroviral envelope glycoprotein 
5 is not only a domain responsible of attaching the viral particle to a cell surface receptor but 
is also a domain that is responsible, upon receptor binding, of fusion activation most likely 
via an interaction with the carboxy-terminal domain of the SU Env subunit (C domain). The 
RBD determinant responsible of this activation appears to be located at its ammo-terminal 
end and can be disrupted by the delH mutation. Thus receptor binding of RBD most 

10 probably promotes a conformational change of this amino-terminal end which, in turn, is 
rendered competent for inducing further conformational changes in the Env C domain that 
control the fusion machinery located in the TM Env subunit. We surmised that the 
replacement of the RBD of a retroviral envelope glycoprotein by a cell type specific ligand 
may redirect the attachment of viral particles carrying such targeting envelope 

15 glycoproteins, yet specific attachment of these virions would not promote fusion activation 
due to the absence in the novel ligand of a molecular device responsible of fusion activation. 
Such an activation, leading to infection, may therefore be restored by providing in trans a 
soluble RBD in the infection mixture. 

Data in Table 6 demonstrate that retroviral vectors carrying a chimeric envelope 

20 glycoprotein in which the wild-type RBD is replaced by a single-chain antibody against 
human class I MHC molecules cannot infect human cells although they can efficiently bind 
to class I MHC molecules expressed at the surface of the target cells. The addition in trans 
of RBD polypeptides during infection can promote the events necessary to allow infection of 
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the target cells, demonstrating that retargeted binding of chimeric viral particles can give 
rise to productive infection and/or gene delivery when the fusion domains of the targeting 
envelope are activated by an exogenous polypeptide. 

5 

Table 6. RBD-mediated activation of the fusion activity of retargeted envelope 
glycoproteins 



Enva 




+ RBDb 


+ RBD & competitor 0 


A 


106 


105 


105 


34.1-A 


<101 


105 


102 



a: LacZ retroviral vectors were generated with the indicated envelope glycoproteins. A, 
15 wild-type amphotropic Env; 34.1-A, a chimeric envelope glycoprotein derived from the 
MLV amphotropic Env in which the receptor binding domain was replaced by a single 
chain antibody (scFv) against human class I MHC molecule (Marin et al, 1995). The 
vectors were used to infect class I positive human cells. Titers as lacZ i.u./ml. 
b: A preparation of A-RBD polypeptide was added during infection. Similar results were 
20 obtained when the A-RBD polypeptide was added before or after infection with the 34.1-A 
retroviruses. 

c: Anti-MHC-I IgGs, corresponding to the Mab which was used to derive the anti-MHC-I 
scFv inserted into the 34.1-A chimera, were used as competitor in this infection assay. The 
anti-MHC-I IgGs were used to block the targeted antigens before infection with the 34.1-A 
25 retroviruses. 
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CLAIMS 

1. Use of at least one activating polypeptide comprising an amino acid 
sequence corresponding to : 

- the amino acid sequence of the surface subunit (SU) of an envelope 
glycoprotein of a first virus, or of a first retrovirus, or 

- an amino acid sequence derived from said surface subunit, or 

- a fragment of an amino acid sequence of said surface subunit, encompassing 
the receptor binding domain (RBD) and the amino terminal part of the surface 
subunit, 

for the rescue of the fusion property of a fusion defective envelope glycoprotein, 
which has the property of attachment to target cells and is inactive with respect to the 
fusion with said target cells, said fusion defective envelope glycoprotein belonging to 
a second virus or a second retrovirus of the same type as or of a different type from 
the above designated "first virus or first retrovirus". 

2. Use of an activating polypeptide according to claim 1, wherein the 
activating polypeptide recognizes substantially the same receptors of the target cells as 
the fusion defective envelope glycoprotein or wherein the activating polypeptide 
recognizes receptors of the target cells which are different from the ones recognized 
by the fusion defective envelope glycoprotein. 

3. Use according to any one of the claims 1 or 2, wherein the activating 
polypeptide and/or the fusion defective envelope glycoprotein is (are) linked to a 
ligand, which is specific with respect to some receptors of the target cells, the ligand 
being preferably linked to the activating polypeptide. 

4. Use according to claim 3, wherein the ligand is chosen among molecules 
liable to recognize receptors of target cells, for instance receptors of tumoral cells, 
such as EGF, FGF, VEGF. 



WO 00/71578 



41 



PCT/EP00/04534 



5. Use according to any one of claims 1 to 4, wherein the activating 
polypeptide is under the form of a product as such, or is expressed by a host, such as 
a virus or a cell, or is expressed, in particular by the target cells, transformed by 
nucleotide sequences coding for said activating polypeptide. 

6. Use according to any one of claims 1 to 5, wherein the fusion defective 
envelope glycoprotein is expressed by a host, such as a virus incorporating in its 
envelope said fusion defective envelope glycoprotein, or such as a cell, transformed 
by nucleotide sequences coding for said fusion defective envelope glycoprotein. 

7. Use according to any one of claims 1 to 6, wherein the activating 
polypeptide corresponds to a fragment of about 200 to about 250 amino acids of said 
surface subunit comprising the receptor binding domain (RED) and comprising the 
first amino acids of the amino terminal part of the surface subunit, the number of said 
first aminoacids varying from about 10 to about 15, in particular comprising the RBD 
and the PHQ peptide, in particular the fragments corresponding to amino acid 
sequences derived from type C mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 
. xenotropic 
. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, 
FELV-B, FELV-C 

in particular wherein the activating polypeptide corresponds to one of the 
following amino acid sequences : 
A-RBD 
E-RBD 
GALV-RBD 
34-1-A-RBD 
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34-1-E-RBD 

34-1-GALV-RBD 

EGF-A-RBD 

EGF-E-RBD 

EGF-GALV-RBD 

8. Use according to any one of claims 1 to 7, wherein the fusion defective 
envelope glycoprotein corresponds to the amino acid sequence of a mutated form of an 
envelope glycoprotein of a virus or a retrovirus which, in its non mutated form, 
presents attachment and fusion properties with respect to target cells, said fusion 
defective envelope glycoprotein being advantagously derived from type C mammalian 
retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 
. xenotropic 
. polytropic 
. 10A1MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, 
FELV-B, FELV-C and comprising : 

* one or several mutations in the amino terminal part of its surface subunit, 
particulary in the first amino acids of the amino terminal part of the surface subunit, 
the number of said first aminoacids varying from about 10 to about 15, said mutations 
inactivating the fusion property without substantially affecting the attachment 
property, in particular comprises one or several mutations within the PHQ amino- 
terminal sequence, wherein in particular the fusion defective envelope glycoprotein 
corresponds to one of the following amino acid sequences : 

- MO R less del H 

- MO del H 

- A R less del H 
-AdelH 
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- PROMO R less delH 

- PROMOdelH 
-GALVR less del H 

- GALVdelH 
-C2RlessdelH 

- C2delH 

- C MOdelH 

- TM MOdelH 
-CTMMOdelh 

* or the replacement of the amino-terminal domain of its surface subunit (RBD) 
by a ligand such as a growth factor, or a cytokine or a single chain antibody, the 
extent of said mutations inactivating the fusion property and the attachement property 
to the normal viral receptor, in particular wherein the fusion defective envelope 
glycoprotein corresponds to the following amino-acid sequence : 

-34.1-A 

* or the replacement of its SU carboxy-terminal domain and / or its TM 
ectodomain, and preferably its SU carboxy-terminal domain by the corresponding SU 
carboxy-terminal domain and / or the TM ectodomain of another "second" 
glycoprotein derived from type C mammalian retroviruses, with the proviso that said 
fusion defective glycoprotein and said second glycoprotein are not derived from the 
same type C mammalian retroviruses, in particular wherein the fusion defective 
envelope glycoprotein corresponds to one of the following amino-acid sequences : 

-CMO 

- TM MO 

- C TM MO 

9. Use of at least one activating polypeptide and of at least one fusion defective 
envelope glycoprotein for the preparation of a drug useful for the treatment of cancer 
and infectious diseases. 
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10. Products containing an activating polypeptide and a fusion defective 
envelope glycoprotein as defined in any one of claims 1 to 9, as a combined 
preparation for simultaneous, separate, or sequential use for the rescue of the fUsion 
defective property of said fusion defective envelope glycoprotein. 
5 11. Products containing an activating polypeptide and a fusion defective 

envelope glycoprotein as defined in any one of claims 1 to 9, as a combined 
preparation for simultaneous, separate or sequential use for the treatment of cancer 
pathologies. 

10 12. Activating polypeptides corresponding to a fragment of about 200 to about 

250 amino acids of said surface subunit comprising the RBD and the PHQ peptide, in 
particular the fragments corresponding to amino acid sequences derived from type C 
mammalian retroviruses such as: 

- Murine leukemia virus such as : 
15 . amphotropic 

. ecotropic 
. xenotropic 
. polytropic 
. 10 Al MLV 

20 - or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, 

FELV-B, FELV-C 

in particular wherein the activating polypeptide corresponds to one of the 
following amino acid sequences : 
A-RBD 

25 E-RBD 

GALV-RBD 

34-1-A-RBD 

34-1-E-RBD 

34-1-GALV-RBD 
30 EGF-A-RBD 

EGF-E-RBD 

EGF-GALV-RBD 
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13. Fusion defective envelope glycoprotein to the amino sequence of an 
envelope glycoprotein of a virus or a retrovirus which in its native state presents 
attachment and fusion properties with respect to target cells, and comprises one or 
several mutations in the amino terminal part of its surface subunit, inactivating the 
fusion property without substantially affecting the attachment property, in particular 
comprises one or several mutations within the PHQ amino sequence of the amino 
terminal part of its surface subunit, wherein in particular the fusion defective envelope 
glycoprotein is derived from type C mammalian retroviruses such as: 

- Murine leukemia virus such as : 

. amphotropic 
. ecotropic 
. xenotropic 
. polytropic 
. 10 Al MLV 

- or GALV type retroviruses and related retrovirus such as SSAV, FELV-A, 
FELV-B or FELV-C 

in particular wherein the fusion defective envelope glycoprotein corresponds to 
one of the following amino acid sequences : 
-AdelH 
-PROMOdelH 

- GALVdelH 
-C2delH 
-ARlessdelH 

- PROMO R less del H 

- GALV R less del H 
-C2RlessdelH 
-CMO 

- TM MO 

- C TM MO 
-CMOdelH 

- TM MOdelH 



WO 00/71578 PCT/EP00/04534 

46 

- C TM MOdelh 

- 34.1-A 

with the proviso that they are different from the ecotropic MLV sequence, wherein the 
amino acids in position 2 to 8 are either substituted or deleted. 
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A-RBD Sequence 



10 20 30 40 50 60 

J123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 9Q1 234 567 890 
AK3 GOG CGT TCA ACG CTC TCA AAA CDC OCT CAA GOT AAG ATT AAC COG TQG AAG CCC TTA 
MetAlaArgSertoLeuSerLysProProGlnAspLysneAsnProTrp 

ASA GTC AUG GGA GTC CTG TTA GGA GTA GOG ATS OCA GAG AGC COC CAT CAG GTC TTT AAT 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro His Gin Val Phe Asn 



GTA AOC TQG AGA 
Val Thr Trp Arg 

GGA ACT GTA CAA 
Gly Thr Val Gin 

GAG TOG GAC OCT 
Glu Trp Asp Pro 

AGA CAG COG AOC 
Arg Gin Arg Thr 

TGT GGG GGA OCA 
Cys Gly Gly Pro 

TAG TQG AAG CCC 
Tyr Trp Lys Pro 



GAC ACQ GGA TOC OCT AAA 
Asp Thr Gly Cys Ser Lys 

TCC TIC CAA GGG OCT ACT 
Ser Phe Gin Gly Ala Thr 

GCA GGA AAA AAG OCT AAC 
Ala Gly Lys Lys Ala Asn 

GGA ACA GAT OCT ATT AOC 
Gly Thr Asp Pro lie Thr 

GTC CCC ATA GGG ATC CAA 
Val Pro lie Gly lie Gin 



TTT GTT AAA GAC AGG ATC 
Ike Val Lys Asp Arg lie 

CTA AAA CCA CTA GAA TAC 
Leu Lys Pro Leu Glu Tyr 



GTC ACC AAC CTG ATC ACT GGG OGT ACC GOC AAT GOC ADC TOC CTC CTG 
Val Thr Asn Leu Met Thr Qly Arg Thr Ala Asn Ala Thr Ser Leu Leu 

GAT GOC TTC OCA AAA TTA TAT TTT GAT CTA TGT GAT CTG CTC GGA GAG 
Asp Ala She Pro Lys Leu Tyr She Asp Leu qys Asp Leu Val Gly Glu 

TCA GAC CAG GAA COG TAT GTC GGG TAT GOC TOC AAG TAG CCC GCA GGG 
Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly 

OGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT AOC GTA AAG TOG GGG 
Arg Thr Efae Asp She T*r val Qys Pro Gly His ttn? Val Lys Ser Gly 

GGA GAG GOC TAC TGT GGT AAA TQG GGG TGT GAA AOC AOC GGA CAG OCT 
Qly Glu Gly Tyr Cys Gly lys Trp Gly qys Glu Thr Thr Gly Gin Ala 

ACA TCA TOG TOG GAC CTA ATC TOC CTT AAG OGC GGT AAC ACC CCC TOG 
Ser Trp Asp Leu lie Ser Leu lys Arg Gly Asn Thr Pro Trp 

CTT GOC TGT GGC CCC TGC TAC GAC CTC TOC AAA GTA TOC AAT 
Val Ala Cys Gly Pro cys Tyr Asp Leu Ser Lys Val Ser Asn 

CGA GGG GGC AGA TGC AAC OCT CTA GTC CEA GAA TTC ACT GAT 
Arg Gly Gly Arg Cys Asn Pro Leu val Leu Glu Phe Thr Asp 

TGG GAC GGG COC AAA TOG TOG GGA CTG AGA CTG TAC CGG ACA 
Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr 

A3K3 TTC TCC CTC AOC OQG CAG GTC CTT AAT GTG GGA CCC OGA 
Met Phe Ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg 

TACCTCTACGACC^OOCGAClMCXrAA^ 
TyrProTyrAspValProAspTyrAlaAsnArgLeuValGln 

TCA GTA GTC CAG GCT TTA GTC CTG ACT CAA CAA TAC CAC CAG 
Ser Val Val Gin Ala Leu Val Leu Thr Gin -Gin Tyr His Gin 

GAG OCA TGA 
Glu Pro ... 
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E-RBD- Sequence 



x ° 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 
ATG GOG CGT TCA ACG CTC TCA AAA OOC CTT AAA AAT AAG GTT AAC COG CGA GQC COC CTA 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC COC TTA ATT CTT CTG ATG CTC AGA GGG CTC AST ACT GOT TOG OCC OXC TCC ACT CCT 
lie Pro Leu lie Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser Pro Gly Ser Ser Pro 



CAT CAA CTC 
His Gin Val 

ACT TCP GQC 
Thr Ser Gly 

TTA GOC CAC 
leu Ala His 

COG GGG OCC 
Pro Gly Pro 

OCT TTA ACC 
Fro Leu Thr 

ACA ACT CAT 
Bit Thr His 

ICC AAG TCA 
Ser Lys Ser 

OCT AGA GCT 
Gly Arg Ala 

ACC TCT GAC 
Thr Ser Asp 

GOG TTT ACA 
Arg Phe Thr 

OCT TTG TAT 
Arg Leu Tyr 

AAT CTA GGA 
Asn Leu Gly 

AAT OGA TTA 
Asn Arg Leu 

QUI CAA TAC 
Gin Gin Tyr 



TAT AAT ATC ACC TOG GAG GTA AOC AAT GGA GkT CGG GAG ACG GTA TGG GCA 
Tyr Asn lie Thr Trp Glu Val Our Asn Gly Asp Arg Glu Thr Val Trp Ala 

AAC CAC OCT CTG TGG ADC TQG TGG OCT GAC CTT ACC OCA GAT TTA TCT AUG 
Asn His Pro Leu Trp Ttir Trp Trp Pro Asp Leu Thr Pro Asp Leu Cys Met 

CAT. GGA OCA TCT TAT TQG QGG CTA GAA TAT CAA TOC OCT TTT TCT TCT OOC 
His Gly Pro Ser Tyr Trp Gly Leu Glu Tyr Gin Ser Pro She Ser Ser Pro 

OCT TOT TOC TCA GOG GQC AGC AGC OCA OOC TCT TOC AGA GAC TGC GAA GAA 
Pro Cys qys Ser Gly Gly Ser Ser Pro Gly cys Ser Arg Asp Qys Glu Glu 

TOC CTC AOC OCT OQG TGC AAC ACT GOC TQG AAC AGA CTC AAG CTA GAC CAG 
SerLeuThrProArgCysAsnThrAlaT^AsnArgL^ 

AAA TCA AAT GAG GGA TTT TAT GTT TOC COC GQG OOC CAC OGC COC OGA GAA 
Lys Ser Asn Glu Gly Phe Tyr Val Cys Pro Gly Pro His Ar* Pro Arg Glu 

TOP GQG GOT CCA GAC TOC TTC TAC TGT GCC TAT TQG GQC TCT GAG ACA ACC 
Cys Gly Gly Pro Asp Ser Phe Tyr Cys Ala Tyr Trp Gly cys Glu Thr Thr 

TAC TQG AAG OOC TOC TCA TCA TQG GAT TTC ATC ACA GTA AAC AAC AAT CTC 
Tyr Trp Lys Pro Ser Ser Ser Trp Asp Phe He Thr Val Asn Asn Asn Leu 

CAG GCT GTC CAG GTA TGC AAA GAT AAT AAG TGG TGC AAC COC TTA GTT ATT 
Gin Ala Val Gin Val cys Lys Asp Asn Lys Trp cys Asn Pro Leu Val lie 

GAC GOC GQG AGA OQG GTT ACT TOC TQG AOC ACA GGA CAT TAC TGG GGC TTA 
Asp Ala Gly Arg Arg Val Thr Ser Trp Thr Thr Gly His; Tyr Trp Gly Leu 

GTC TOC GGA CAA GAT OCA GGG CTT ACA TTT GGG ATC AGA CTC AGA TAC CAA 
Val Ser Gly Gin Asp Pro Gly Leu Thr Ihe Gly He Arg Leu Arg tyr Gin 

OOC OGC GTC CCA ATA GGG ATC CAA TAC OCC TAC GAC GTC OOC GAC TAT GOC 
Pro Arg Val Pro He Gly He Gin "tyr Pro Tyr Asp Val Pro Asp Tyr Ala 

GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTG ACT 
Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu Thr 

CAC CAG CTA AAA OCA CTA GAA TAC GAG CCA TCA 
His Gin Leu Lys Pro Leu Glu Tyr Glu Pro . 
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GALV-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 S67 890 

ATG GTA TTG CTG OCT 006 TOC ATG CTT CTC AOC TCA AAC CTG CAC CAC CTT COG CAC CAG 60 
Met Val Leu Leu Pro Gly Ser Met Leu Leu Thr Ser Asa Leu His His Leu Arg His Gin 



ATG AST OCT OGG AGC TOG AAA ASA CTG ATC ATC 
Met Ser Pro Gly Ser Trp Lys Arg Leu lie He 

GOG AOG AST CTG CAA AAT AAG AAC COC CAC CAS 
Gly Thr Ser Leu Gin Asn Lys Asn Pro His Gin. 

TOC CAA ACT GGA GAC GTT GTC TOG GAT ACA AAG 
Ser Gin Thr Gly Asp Val Val Trp Asp Thr Lys 

CCC ACA CTT AAA OCT GAT GTA TGT GCC TTG GOG 
Pro Thr Leu Lys Pro Asp Val Cys Ala Leu Ala 

GGA AOC GAT GTC TOG TOC TCT AAA GGA GTC AGA 
Gly Thr Asp Val Ser Ser Ser Lys Arg Val Arg 

TAT AAG CAA ATC AOC TOG GGA GCC AXA GGG TOC 
Tyr Lys Gin He Thr Trp Gly Ala lie Gly cys 

GCA AOC TCT AOC TTC TAG GTA TGT COC OGG GAT 
Ala Ser Ser Thr Phe Tyr Val Cys Pro Arg Asp 

TOC GOG GOG CTA GAA TOC CIA TAC TGT AAA GAA 
Cys Gly Gly Leu Glu Ser Leu Tyr Cys Lys Glu 

TAT TGG CTA TCT AAA TOC TCA AAA GAC CTC ASA. 
Tyr Trp Leu Ser Lys Ser Ser Lys Asp Leu He 

TGG ACT CAA AAA TTT CAA CAG TGT CAC CAG AOC 
Trp Thr Gin Lys Itae Gin Gin Cys His Gin Thr 

TTC ACA GAC AAA GGA AAA TEA TOC AAG GAC TGG 
Phe Thr Asp Lys Gly Lys Leu Ser Lys Asp Trp 

TTC TAT GTG TCT GGA CAT OCA GGC GTA CAG TTC 
Phe Tyr Val Ser Gly His Pro Gly Val Gin Phe 

CCA GCT GTG GCA GTA GGG ATC CAA TAC CCC TAC 
Pro Ala Val Ala Val Gly lie Gin Tyr Pro Tyr 

GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA GTC 
Val Gin Phe Val Lys Asp Arg He Ser Val Val 

CAC. CAG CTA AAA OCA CTA GAA TAC GAG CCA TGA 
His Gin Leu Lys Pro Leu Glu Tyr Glu Pro . . . 



CTC 
Leu 


TTA 
Leu 


AGC 
Ser 


TOC 
Cys 


GTA TTC GGC GGC GGC 
Val Phe Gly Gly Gly 


120 


GCC 
Pro 


ATG 
Met 


AOC 
Thr 


CTC 
Leu 


ACT TG&GAG GTA CTG 
Thr Trp Gin Val Leu 


180 


OCA 
Ala 


GTC 
Val 


CAG 
COn 


COC 
Pro 


OCT TGG ACT TGG TGG 
Pro Trp Thr Trp Trp 


240 


GCT 


AGT 
Ser 


CTT 
Leu 


GAG 
Glu 


TOC TGG GAT ATC OGG 
Ser Trp Asp He Pro 


300 


OCT 
Pro 


GOG 
Pro 


GAC 
Asp 


TCA 
Ser 


GAC TAT ACT GCC GCT 
Asp Tyr Thr Ala Ala 


360 


AGO 
Ser 


TAC 
Tyr 


OCT 
Pro 


OGG 
Arg 


GCT AGG ACT AGA ATG 
Ala Arg Thr Arg Met 


420 


GGC 
Gly 


GOG 
Arg 


ACC 
Thr 


CTT 
Leu 


TCA GAA GCT AGA AGG 
Ser Glu Ala Arg Arg 


480 


TOG 
Trp 


GAT 
Asp 


TGT 
Cys 


GAG 
Glu 


AOC AOG GGG AOC GGT 
Thr Thr Gly Thr Gly 


540 


ACT GTA 
Thr Val 


AAA 
lys 


TGG 

Trp 


GAC CAA AAT AGC GAA 
Asp Gin Asn Ser Glu 


600 


GGC 
Gly 


TGG 
Trp 


TGT 

Cys 


AAC 
Asn 


COC CTT AAA ATA GAT 
Pro Leu Lys He Asp 


660 


AXA 
He 


AOG 
Thr 


GGA 
Gly 


AAA 

Lys 


AOC TGG GGA TTA AGA 
Thr Trp Gly Leu Arg 


720 


AOC 
Thr 


ATT 
He 


GGC 
Arg 


TTA 
Leu 


AAA ATC AOC AAC ATG 
Lys lie Thr Asn Met 


780 


GAC 
Asp 


GTC 
Val 


COC 
Pro 


GAC 
Asp 


TAT GCC AAT GGA TTA 
Tyr Ala Asn Arg Leu 


840 


CAG 
Gin 


GCT 
Ala 


TTA 

Leu 


GTC 
Val 


CTG ACT CAA CAA TAC 
Leu Thr Gin Gin Tyr 


900 



933 
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EGF-A-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 34S 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

ATG GOG CGT TCA AOG CTC TCA AAA COC CTT AAA AAT AAG GTT AAC COG CGA GGC CCC CTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC COC TTA AIT CTT CTG ATG CTC AGA GGS GTC AGT ACT GCT TOG COC GGC TOC AGT GOG 120 
lie Pro Leu lie Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser Pro Gly Ser Ser Ala 

GCC CAG COG GOC ATG GCC AAT AGT ®C TCT GAA TGT COC CTG TOC CAC GAT GGG TAG TGC 180 
Ala Gin Pro Ala Met Ala Asn Ser Asp Ser Glu Cys Pro Leu Ser His A§p Gly Tyr Qys 

CTC CAT GAT GCT GTG TGC ATG TAT ATT GAA GCA TIG GAC AAG TAT OCA TGC AAC TGT GTT 240 
leu His Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp I#s Tyr Ala Cys Asn Cys Val 

GTT OGC TAG ATC GGG GAG CGA TGT CAG TAC OGA GAC CTG AAG TOG TOG GAA CTG OGC GOG 300 
Val Gly Tyr He Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg Ala 

GCC GCA ATC GAG GGA AGG ATG GCA GAG AGC COC CAT CAG GTC TTT AAT GTA AOC TOG AGA 360 
Jtta Ala He Glu Gly Arg Met Ala Glu Ser Pro His Gin Val Phe Asn Val Tfar Trp Arg 

■. 

GIC AOC AAC CTG ATG ACT GGG OCT AOC GOC AAT GOC AOC TOC CTC CTG GGA ACT GTA CAA 420 
Val Thr Asn Leu Met Thr Gly Arg Tfar Ala Asn Ala Thr Ser Leu Leu Gly Thr Val Gin 

GAT GOC TTC GCA AAA TTA TAT TTT GAT CTA TGT GAT CTG CTC GGA GAG GAG TOG GAC OCT 480 
Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu Trp Asp Pro 

TCA GAC GAG GAA GOG TAT GTC GGG TAT GGC TGC* AAG TAC CCC GCA GGG AGA CAG OGG AOC 540 
Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys ^r Rdo Ala Gly Arg Gin Arg Tar 

GGG ACT TTT GAC TTT TAC GTG TGC OCT GGG CAT AOC GTA AAG TOG GGG TCT GGG GGA CCA 600 
Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly Cys Gly Gly Pro 

GGA GAG GGC TAC TGT GGT AAA TOG GGG TGT GAA AOC AOC GGA CAG GCT TAC TOG AAG CCC 660 
Gly Gin Gly Tyr Cys Gly lys Trp Gly CysGOnThrlfcrGlyGlnAlaTyrTrpLysPro 

ACA TCA TOG TOO GAC CTA ATC TOC CTT AAG OGC GGT AAC AOC CCC TOG GAC AGG GGA TGC 720 
Thr Ser Ser Trp Asp 1^ ne Ser 1^ Lys Arg 

TCT AAA GTT GOC TGT GGC GOC TOC TAC GAC CTC TCC AAA GTA TOC AAT TOC TTC CAA GGG 780 
Ser Lys Val Ala Cys Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn Ser Phe Gin Gly 

GCT ACT OGA GGG GGC AGA TGC AAC OCT CTA GTC CTA GAA TTC ACT GAT GCA GGA AAA AAG 840 
Ala Thr Arg Gly Gly Arg cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala Gly Lys Lys 

GCT AAC TOG GAC GGG COC AAA TOG TOG GGA CTG AGA CTG TAC OGG ACA GGA ACA GAT OCT 900 
Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly Thr Asp Pro 

ATT ACC ATG TTC TCC CTG ACC OGG CAG GTC CTT AAT GTG GGA GOC OGA* GTC COC ATA GGG 960 
He Thr Met Phe Ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val Pro lie Gly 

ATC CAA TAC COC TAC GAC GTC COC GAC TAT GOC AAT OGA TTA GTT CAA TTT GTT AAA GAC 1020 
lie Gin Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Asn Arg Leu Val Gin Phe Val Lys Asp 

AGGATCTCAGTAGTCCAGGCTTTAGR: CTG ACT CAA CAA TAC CAC CAG CTA AAA OCA CTA 1080 
Arg lie Ser Val Val Gin Ala Leu Val Leu Thr Gin Gin Tyr His Gin Leu Lys Pro Leu 

GM TAC GAG CCA TGA 

Glu Tyr Glu Pro ... 1095 
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EGF-E-RBD- Sequence 



10 20 30 40 w 

123 456 789 012 345 678 90T 734 567 890 123 A 5 6 789 012 34S 678 901 234 ^7 «Qn 
AIG CCC CGT TCA ACG CTC TCA AAA CCC CTT AAA AAT AAG GTT AAC COG CGA GGC CCC CTA 
«e t Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Fro Ar£ E E 2 



60 
120 



WC 00C TTA ATT CTT CTG ATG CTC AGA 008 GTC ACT ACT OCT TOG CCC GOC'tCC ACT OOG 
Xle.Pro Leu lie Leu Leu Het Leu Arg Gly Val Ser Thr Ala Ser Pro Gly Sot Ser Ala 

GCC CAG CCG GCC ATG GCC AAT ACT GAC TCT GAA TGT CCC CTG TOC CAC GAT 03G TAC TGC 180 
Ala Gin Pro Ala Met Ala Asn Ser Asp Ser Glu Cys Pro Leu Ser His AsJ? Gly Tyr Cys 

CTC CAT GAT QGT GTS TOC ATG TAT ATT GAA OCA TTG GAC AAG TAT GCA TGC AAC TGT CTT 240 
Leu His Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys Val 

GTT GGC TAC ATC GOS GAG CGA TGT CAG TAC CGA GAC CIG AAG TOG TGG GAA CTG GGC GOG 300 
ValGlyTyr IleGlyGluA^CysGlnTy,^^^^^^^^^^ 300 

Ala Ala ne Glu Gly Arg Ala Ser Pro Gly Ser Ser Pro His Gin Val Tyr Asn lie Thr 
Tip Glu Val Thr Asn Gly Asp Arg Glu Thr Val Trp Ala Thr Ser Gly Asn His Pro Leu 

SE5E=ssEEEE-ri;2 ==s - = 
e=e=eeeeeeee2--2-- = - - 



360 



420 



480 



EroPheSerSerProProGlyProProCysCysSer 
Gly Gly Ser Ser Pro Gly Cys Ser Arg Asp Cys Glu Glu Pro Leu Thr Ser Leu Thr Pro 

COS TOC AAC ACT GCC TGG AAC AGA CTC AAG CTA GAC CAG ACA ACT CAT AAA TCA AAT GAG 
Arg CyB Asn Thr Ala Trp Asn Arg Leu Lys Leu Asp Gin Thr Thr His Lys Sot Asn Glu 

GCa TTT TOT GTT TGC CCC GOG CCC CAC CGC CCC CGA. csja irr ~ 

<*^™<**o«y*oHl.SSSEEEES:£££ 

&m «^ "80 GGC TGT GAG ACA ACC OCT AGA OCT TAC TOG AAG CCC 

Asp Sot Phe Tyir Cys Ala Tyr Trp Gly Cys Glu Thr Thr Gly Arg Ala Tyr Trp Lys Pro 

Cm- ""^ TGG GAT TTC ATC ACA CTA AAC AAC AAT CTC ACC TCT GAC CAG GCT GTC CAG 
Ser Ser Ser Trp Asp Phe lie Thr Val Asn Asn Asn Leu Thr Ser Asp Gin Ala Val Gin 

V^^f^^^^^^^^^^^^^ACAGACGCCGGGAGA 
Val Cys Lys Asp Asn Lys Trp Cys Asn Pro Leu Val lie Arg Phe Thr Asp Ala Gly Arg 

A^^^^^^^^^^^^^^^^TGTCTCCGGACAA 
Arg Val Thr Ser Trp Thr Thr Gly His Tyr Trp Gly Leu Arg Leu Tyr Val Ser Gly Gin 

asp m> Gly Leu Thr Phe Gly lie Arg Leu Arg Tyr Gin Asn Leu Gly Pro Arg Val Pro 

lie Gly lie Gin Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Asn Arg leu Val Gin Phe Val 

AAA GAC AGS ATC TCA CTA GTC CAG OCT TTA GTC CTG ACT CAA CAA TAC CAC CAG CTA AAA 
Lys Asp Arg lie Ser Val Val Gin Ala Leu Val Leu Thr Gin Gin Tyr His Gin Leu Lys 
CCA CTA GAA TAC GAG CCA TGA 

Pro Leu Glu Tyr Glu Pro . . . 1161 
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EGF-GALV-RBD- . Sequence 



10 20 30 40 50 60 



123 


456 789 


012 345 678 901 234 567 890 




AAA 


AAT 
Asn 


AAG 
Lys 


GTT AAC COG OGA GGC OCC CTA 

Val A<5r» Prm Ayyt f2lvr t>m T nti 
"°* nu \3J.jf rCO LcU 


60 


GTC 

VOX 


AGT 
Ser 


ACT 

Thr 


GOT TOG OCC GGC TOC AGT GOG 
Aia ser .Pro Gly Ser Ser Ala 


120 


GAA 
Glu 


Cys 


POP 

Pro 


CTC TOC CAC GA$»,GQG TAC TOO 
Leu Ser His Asp Gly Tyr Cys 


180 


GCA 
Ala 


TTG 
Leu 


vxftJL. 
Asp 


AAG TAT GCA TGC AAC TCT GTT 
Lys Tyr Ala Cys Asa Cys Val 


240 


OGA 
Arg 


GAC 
Asp 


CTO 
Leu 


AAG TOG TOG GAA CTO CGC GOG 
Lys Trp Trp Glu Leu Arg Ala 


300 


AAG 
Lys 


AAC 
Asa 


COC 
Pro 


CAC CAG OCC ATO AOC CTC ACT 
His Gin Pro Met Thr Leu Thr 


360 


GTC 
Val 


TOG 
Trp 


GAT 
Asp 


ACA AAG GCA GTC CAG COC OCT 
Tar Lys Ala Val Gin Pro Pro 


420 


GTA 
Val 


TOT 

cys 


GOC 
Ala 


TTC GOG OCT AGT CTT GAG TOC 
Leu Ala Ala Ser Leu Glu Sex 


480 


TCT 
Ser 


AAA 
lys 


OGA 
Arg 


GTC AGA OCT COG GAC TCA GAC 
Val Arg Pro Pro Asp Ser Asp 


540 


OGA 
Gly 


Ala 


MfTTK 

A1A 

lie 


GGG TOC AGC TAC OCT OGG OCT 
Gly Cys Ser Tyr Pro Arg Ala 


600 


GTA 
Val 


TOT 

Cys 


COC 
Pro 


OQG GAT GOC OGG ACQ CTT TCA 
Arg Asp Gly Arg Thr Leu Ser 


660 


CTA 
Leu 


tac 
Tyr 


TOT 
Cys 


AAA GAA TOG GAT TOT GAG AOC 
lys Glu Trp Asp Cys Glu Tbr 


720 


TCA 

ser 


AAA 

Lys 


GAC 
Asp 


CTC ATA ACT GTA AAA TOG GAC 
Leu lie Thr Val Lys Trp Asp 


780 


GAG 
Gin 


TOT 
Cys 


CAC 
His 


CAG AOC GGC TOG TOP AAC COC 
Gin Thr Gly Trp Cys Asn Pro 


840 


TTA 
Leu 


TQC 

Ser 


AAG 
Lys 


GAC TOG ATA ACG GGA AAA AOC 
Asp Trp lie Thr Gly Lys Thr 


900 


OCA 
Pro 


GGC 
Gly 


GTA 
Val 


CAG TTC AOC ATT CGC TTA AAA 
Gin Phe Thr lie Arg Leu Lys 


960 



GGG 


ATC 


CAA 


TAC 


OCC TAC GAC GTC COC GAC TAT 


1020 


Gly 


He 


Gin 


Tyr 


Pro Tyr Asp Val Pro Asp Tyr 




GAC 


AGG 


ATC 


TCA 


GTA GTC CAG GCT TTA GTC CTC 


1080 


Asp 


Arg 


lie 


Ser 


Val Val Gin Ala Leu Val Leu 




CTA 


GAA 


TAC 


GAG 


OCA TCA 


1125 


Leu 


Glu 




Glu 


Pro ... 
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34.1-A-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

ATG GOG OCT TCA AOG CTC TCA AAA CCC CTT AAA AAT AAG GTT AAC COG GGA GGC COC CTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Fro Leu Lys Asn Lys Val Asn Fro Arg Gly Fro Leu 

ATC CCC TEA ATT CTT CTC ATS CTC AGA GGG CTC ACT ACT OCT TOG CCC GGC TOC ACT GOG 120 
lie Fro Leu lie Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser Fro Gly Ser Ser Ala 

GCC CAG COG GOC ATG GOC CAG GTT CAG CTT CAG CAG TCT GGA GCT GAA CTG CTA AAG OCT 180 
Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Lys Pro 

GOG GCT TCA GTG AAG TTC TOC TOC AAG GCT TCT GGC TAG AOC TTC ACA AGC TAT GAT ATA 240 
Gly Ala Ser Val Lys Leu Ser Cys Lys Ala Ser Gly Tyr Thr Fhe Thr Ser Tyr Asp He 

AAC TOG GIG AGG CAG AGG OCT GAA CAG GGA CTT GAG TGG ATT GGA TOG ATT TTT OCT GGA 300 
Asn Trp Val Arg Gin Arg Fro Glu Gin Gly Leu Glu Tip lie Gly Trp'lle Fhe Pro Gly 

GAT GGT ACT ACT AAG TAC AAT GAG AAG TTC AAG GGC AAG GOC ACA CTG ACT ACA GAC AAA 360 
Asp Gly Ser Tfrr Lys Tyr Asn Glu Lys Fhe lys Gly Lys Ala Thr Leu Thr Thr Asp lys 

TOC TOC AGC ACA GOC TAC ATG CAG CTC AGC AGG CTG ACA TCT GAA GAC TCT GCT ATC TAT 420 
Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Arg Leu Thr Ser Glu Asp Ser Ala lie Tyr 

TTC TOT GCA AGA ATG GOC GAC TAT GGT CAC TAC GGC GCT GTG GAC TAC TGG GGT CAA GGG * 480 
Fhe Cys Ala Arg Met Ala Asp Tyr Gly His Tyr Gly Ala Val Asp Tyr Trp Gly Gin Gly 

AOC AGG GTC AOC GTC TOC TCA GGT GGA GGC GGT TCA GGC GGA GGT GGC TCT GGC GCT GGC 540 
Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 

GGA TOG GAC ATC CAG ATG ACT CAG TCT CCA TOC TOC CTG TCT GOC TCT CTG GGA GAC AGA 600 
Gly Ser Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Leu Gly Asp Arg 

GTC AOC ATC ACT TOC AGG GCA ACT CAG GAC ATT AGC AAT TAT TTA AAC TGG TAT CAG CAG 660 
Val Thr He Ser Cys Arg Ala Ser Gin Asp lie Ser Asn Tyr Leu Asn Trp Tyr Gin Gin 

ACA CCA GAT GGA ACT GTT AAA CTC CTG ATC TAC TAC ACA TCA AGA TTA CAC TCA GGA <3TC 720 
Thr Pro Asp Gly Thr Val Lys Leu Leu lie Tyr Tyr Thr Ser Arg Leu His Ser Gly Val 

CCA TCA AGG TIC ACT GGC ACT GGG TCT GGA ACA GAT TAT TCT CTC AOC ATT AGC AAC CTG 780 
Pro Ser Arg She Ser Gly Ser Gly Ser Gly Thr Asp Tyr Ser Leu Thr lie Ser Asn Leu 

GAG CAA GAA GAT ATT GOC ACT TAC TTT TGC CAA CAG GCT TAT AGG CTT COG TGG ACG TTC 840 
Glu Gin Glu Asp lie Ala Thr Tyr Fhe Cys Gin Gin Gly Tyr Thr Leu Pro Trp Thr Fhe 

GGT GGA GGC AGC AAG CTG GAG ATC AAA OGG GOG GOC GCA ATC GAG GGA AGG ATG GCA GAG 900 
Gly Gly Gly Thr Lys Leu Glu lie lys Arg Ala Ala Ala He Glu Gly Arg Met Ala Glu 

AGC COC CAT CAG GTC TTT AAT GTA ACC TGG AGA GTC AOC AAC CTG ATG ACT GGG OCT AOC 960 
Ser Pro His Gin Val Fhe Asn Val Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr 

GCC AAT GCC AOC TOC CTC CTG GGA ACT GTA CAA GAT GOC TTC CCA AAA TTA TAT TTT GAT 1020 
Ala Asn Ala Thr Ser Leu Leu Gly Thr Val Gin Asp Ala Fhe Pro Lys Leu Tyr Fhe Asp 

CTA TGT GAT CTG GTC GGA GAG GAG TGG GAC CCT TCA GAC CAG GAA COG TAT GTC GGG TAT 1080 
Leu Cys Asp Leu Val Gly Glu Glu Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr 
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34. 1 -A-RBD-. Sequence 



10 20 30 40 50 60 

123 456 7 8 9 012 345 678 901 23 4 567 890 123 456 789 012 345 678 901 234 567 890 



GGC TGC AAG TAC CCC GCA GOG AGA CAG 
Gly Cys Lys Tyr Pro Ala GQLy Arg Gin 


CGG 
Arg 


AOC 
Ihr 


CGG 
Arg 


ACT 
Thr 


TTT 
Phe 


GAC 
Asp 


TTT 
Phe 


TAC 


GIG 
Val 


TGC 
Cvs 


OCT 

Pro 


1140 


GOG CAT ADC GTA AAG TOG GGG TGT GOG 
Gly His Thr Val Lys Ser Gly Cys Gly 


GGA 
Glv 


OCA 
Pro 


GGA 
Glv 


GAG 
Glu 


GGC 

MOT 


TAC 
Tyr 


TGT 

cys 


GGT 
*»«Ly 


AAA 


TOG 

xrp 


GGG 

Civ 


1200 


TGT GAA ACC AOC QGA CAG GCT TAC TOG 
cys Giii Tux xojl (axy win aim. iyr ixp 


AAG 

Lys 


CCC 
Fro 


ACA 
Thr 


TCA 
Ser 


TOG 
Ser 


TOG 
Trp 


GAC 
Asp 


CTA 
Leu 


ATC 
lie 


TOC 
Ser 


CTT 
Leu 


1260 


AAG GGC GCT AAC ACC CCC TOG GbC AOG 
Lys Arg Gly Asn Thr Pro Trp Asp Thr 


GGA 
Gly 


TGC 

Cys 


TCT 
Ser 


AAA 
Lys 


CTT 
Val 


GOC 
Ala 


TGT 

cys 


GGC 
Gly 


CCC 
Pro 


TOC 
cys 


TAC 
Tyr 


1320 


GAC CTC TCC AAA GTA TOC AAT TOC TTC 
Asp Leu Ser Lys Val Ser Asn -Ser Phe 


CAA 
Gin 


GGG 
Gly 


GCT 
Ala 


ACT 
Thr 


OGA 
Arg 


GGG 
Gly 


GGC 
Gly 


AGA 
Arg 


TGC 
Cys 


AAC 
Asn 


OCT 
Pro 


1380 


CIA GTC CTA GAA TTC ACT GAT GCA GGA 
Leu Val Leu Glu Phe Thr Asp Ala Gly 


AAA 
lys 


AAG 

Lys 


GCT 
Ala 


AAC 
Asn 


TOG 
Trp 


GAC 
Asp 


GGG 
Gly 


CCC 
Pro 


AAA 
Lys 


TOG 
Ser 


TOG 
Trp 


1440 


GGA. CTG AGA CTG TAC CGG ACA GGA ACA 
Gly Leu Arg Leu iyr Arg Bur Gly Thr 


GAT 
Asp 


OCT 
Pro 


A!TT 
£Le 


ACC 
Thr 


AXG 
Met 


TTC "TOC 
Phe Ser 


CTG 
Leu 


AOC 
Thr 


OGG 
Arg 


CAG 
Gin 


1500 


GTC CTT AAT GIG GGA CCC OGA GTC CCC 
Val Leu Asn Val Gly Pro Arg Val Pro 


ATA 
He 


GGG 
Gly 


ATC 
He 


CAA 
Gin 


TAC 
Tyr 


CCC 
Pro 


TAC 

Tyr 


GAC 
Asp 


GTC 
Val 


CCC 
Pro 


GAC 
Asp 


1560 


TAT GOC AAT OGA TTA GTT CAA TTT GTT 
Tyr Ala Asn Arg Leu Val Gin She Val 


AAA 
Lys 


GAC 
Asp 


AGG 
Arg 


ATC 
He 


TCA 
Ser 


GTA 
Val 


GTC 
Val 


CAG 
Gin 


GCT TTA 
Ala Leu 


GTC 
Val 


1620 


CTG ACT CAA CAA TAC CAC CAG CTA AAA 
Lea Thr Gin Gin Tyr His Gin Leu lys 


CCA 

Pro 


CTA 
Leu 


GAA 
Glu 


TAC 
Tyr 


GAG 

Glu 


OCA 
Pro 


TGA 










1668 
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34.1 -E-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 3 45 678 901 234 567 890 123 456 789 012 345 678 901 234 567 B90 
ATS GOG OCT TCA ACG CTC TCA AAA, OCC CTT AAA AAT AAG GTT AAC OCXS OGA GQC OCC CTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC COC TTA ATT CTT CTC ATG CTC AGA GGG GTC AST ACT GCT TOG O0C G<fe TOC AGT GOG 120 
He Pro Leu lie Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser Pro Gly Ser Ser Ala 

GCC CAS COG GCC ATG GCC CAG GTT CAG CTT CAS CAS TCT GGA GCT GAA CTG GTA AAG OCT 180 
Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Lys Pro 

GGG GCT TCA GIG AAG TTG TOC TOC AAG GCT TCT GQC TAC ACC TTC ACA AGC TAT GAT ATA 240 
Gly Ala Ser Val lys Leu Ser Cys lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr Asp lie 

AAC TGG GTG AGG CAG AGG OCT GAA CAS GGA CTP GAG TOG ATT GGA TSG ATT TTT OCT GGA 300 
Asa Trp Val Arg Gin Arg Pro Glu Gin Gly Leu Glu Tip lie Gly Trp lie Phe Pro (Sly 

GAT GGT AGT ACT AAG TAC AAT GAG AAG TTC AAG GQC AAG GCC ACA CTG ACT ACA GAC AAA 360 
Asp Gly Ser Thr Lys Tyr Asn Glu lys Phe lys Gly Lys Ala Thr Leu Thr Thr Asp Lys 

TOC TOC AGC ACA GCC TAC ATG CAG CTC AGC AGG CTG ACA TCT GAA GAC TCT GCT ATC TAT 420 
Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Arg Leu Thr Ser Glu Asp Ser Ala lie Tyr 

TTC TGT GCA AGA ATG GOC GAC TAT GCT CAC TAC GGC GCT GTG GAC TAC TGG GGT CAA GGG 480 
Phe Cys Ala Arg Met Ala Asp Tyr Gly His Tyr Gly Ala Val Asp Tyr Trp Gly Gin Gly 

ACC ACG GTC ACC GTC TOC TCA GGT GGA GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC 540 
Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 

GGJl TOG GAC ATC CAG ATG ACT CAG TCT CCA TOC TOC CTG TCT GCC TCT CTG GGA GAC AGA 600 
Gly Ser Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Leu Gly Asp Arg 

GTC ACC ATC AGT TSC AGG GCA AGT CAG GAC ATT AGC AAT TAT TTA AAC TGG TAT CAG CAG 660 
Val Thr lie Ser Cys Arg Ala Ser Gin Asp IleSerAsniyrLeuAsnTrpTyr Gin Gin 

AC* CCA GAT GGA ACT GTT AAA CTC CTG ATC TAC TAC ACA TCA AGA TTA CAC TCA GGA GTC 720 
ttir Pro Asp Gly Thr Val lys Leu Leu lie Tyr Tyr Thr Ser Arg Leu His Ser Gly Val 

CCA TCA AGG TTC AGT GGC AGT GGG TCT GGA ACA GAT TAT TCT CTC ACC ATT AGC AAC CTG 780 
Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Ser Leu Thr lie Ser Asn Leu 

GAG CAA GAA GAT ATT GCC ACT TAC TTT TOC CAA CAG GGT TAT ACG CTT COG TGG AGG TTC 840 
Glu Gin Glu Asp lie Ala Thr Tyr Ifce Cys Gin Gin Gly Tyr Thr Leu Pro Trp Thr Phe 

GGT GGA GGC ACC AAG CTG GAG ATC AAA CQG GOG GOC GCA ATC GAG GGA AGG GCT TOG CCC 900 
Gly Gly Gly Thr Lys Leu Glu lie Lys Arg Ala Ala Ala He Glu Gly Arg Ala Ser Pro 

GGC TOC AGT CCT CAT CAA GTC TAT AAT ATC ACC TGG GAG GTA AOC AAT GGA GAT GGG GAG 960 
Gly Ser Ser Pro His Gin Val Tyr Asn He Thr Trp Glu Val Thr Asn Gly Asp Arg Glu 

ACG GTA TGG GCA ACT TCT GQC AAC CAC CCT CTG TOG AOC TOG TGG CCT GAC CTT ACC CCA 1020 
Thr Val Trp Ala Thr Ser Gly Asn His Pro Leu Trp Thr Trp Trp Pro Asp Leu Thr Pro 

GAT TTA TGT ATG TTA GOC CAC CAT GGA CCA TCT TAT TOG GGG CTA GAA TAT CAA TOC OCT 1080 
Asp Leu Cys Met Leu Ala His His Gly Pro Ser Tyr Trp Gly Leu Glu Tyr Gin Ser Pro 
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34.1-E-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 S67 890 123 456 7flQ Q12 3 AS 678 QQ1 234 567 890 
ITT TCT TCT OOC COG GQG ODC OCT TOT TOC TCA GGG GGC AGC AX CCA GGC TOP TOC AGA 
Ihe Ser Ser Pro Pro Gly Pro Pro Cys Cys Ser Gly Gly Ser Ser Pro Gly Cys Ser Arg 

GfcC TOC GAA GAA OCT TTA AOC TOC CTC AGC OCT OGG TGC AAC ACT "QCC TC§5 AAC AGA CTC 
AspC^GluGluProLeulteSerl^atoEroArgC^Asal^Ala Trp* Asn Arg Leu 

AAG CTA GAC CAG ACA ACT CAT AAA TCA AAT GAG GGA TOP TAT GOT TOC OOC GGG CCC CAC 
Lys Leu Asp Gin TSir Ttor His lys Ser Asn Glu Gly Phe lyr Val Cys Pro Gly Pro His 

OBC CCC OGA GAA TOC AAG TCA TCT GGG GGT OCA GAC TOC OTC TAC TOP GOC TAT TOG GQC 
Arg Pro Arg Glu Ser Lys Ser Cys Gly Gly Pro Asp Ser Phe Tyr Cys Ala Tyr Trp Gly 

TOT GAG ACA ACC GGT AGA GCT TAC TOG AAG OOC TOC TCA TCA TOG GAT OTC ATC ACA CTA 
Os Glu Kir Oto Gay Arg Ala Tyr Trp Lys Pro Ser s« 

AiC AAC AAT CTC ACC TCT GAC CAG GCT GTC CAG CTA TOC AAA GAT AAT AAG TOG TOC AAC 
Asn Asa Asn Leu ttr Ser Asp Gin Ala Val Gin Val qys lys Asp Asn *ys Trp cys Asn 

.OOC TTA, GTT .ATT OQG .TTT ACA GAC GOC GGG AGA OGG GOT ACT TOC TOG AOC ACA GGA CAT 
Pro Leu Val He Arg Phelnr Asp Ala Gly Arg Arg Val Tnr Ser Trp Tar Thx Gly His 

TIC TOG GGC TTA OCT TIG TAT GTC TOC GGA CAA GAT CCA GOG CTT ACA TTT GQG ATC AGA 
**r Trp Gly toi Arg Leu Tyr Val Ser Gly Oln Asp Pro Gly Leu Tnr Bfae Gly lie Arg 

C1C AGA TAC CAA AAT CTA GGA OOC GQC GTC OCA ASA GGG ATC CAA TAC OOC TAC GAC QIC 
Ltfii Arg Tyr Gin Asn Leu Gly Pro Arg Val Pro lie Gly He Gin Tyr Pro Tyr Asp Val 

CCC GAC TAT GOCvAAT OGA TTA GTT O^A TTT GTT AAA GAC AGG ATC TCA GPA GTC CAG GCT 
Pato Asp Tyr Ala Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala 

TTK GTC CTO ACT CAA CAA TAC CAC CAG CTA AAA OCA CTA GAA TAC GAG CCA TCA 
Lea Val Leu *hr Gin Gin Tyr His Gin Leu Lys Pro Leu Glu l*r Glu Pro ... 



1140 



1200 



1260 



1320 



1380 



1500 
1560 
1620 
1680 
1734 
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34.1-GALV-RBD- Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

ATG GOG CGT TCA AGG CTC TCA AAA OCC CTT AAA AAT AAG GTT AAC OCXS OGA GQC COC CTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asn Pro Arg Gly Pro Leu 

ATC GCC TTA ATT CTT CTC ATG CTC AGA 006 GTC ACT ACT OCT TOG COC GQ&.TCC ACT GOG 120 
lie Pro Leu He Leu Leu Met Leu Arg Gly Val Ser Thr Ala Ser Pro Gly Ser Ser Ala 

GCC CAG CCG GCC ATG GCC CAG GTT CAG CTT GAG GAG TCT GGA GOT GAA CTG CTA AAG OCT 180 
Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Lys Pro 

GOG GCT TCA GTG AAG TTC TCC TGC AAG GCT TCT GGC TAC AOC TTC ACA AGC TAT GAT ATA 240 
Gly Ala Ser Val Lys Leu Ser cys lys Ala Ser Gly Tyr Thr She Thr Ser Tyr Asp lie 

AAC TOG GTG AGG CAG AGG OCT GAA CAG GGA CTT GAG TOG ATT GGA TGG ATT TIT OCT GGA 300 
Asn Trp Val Arg Gin Arg Pro Glu Gin Gly Leu Glu Trp lie Gly Trp He Phe Pro Gly 

GAT GOT ACT ACT AAG TAC AAT GAG AAG TTC AAG GQC AAG GCC ACA CTG ACT ACA GAC AAA 360 
Asp Gly Ser Tfar Lys Tyr Asn Glu lys Ifce lys Gly lys Ala Thr Leu Thr Thr Asp Lys 

TCC TOC AGC ACA GCC TAC ATG CAG CTC AGC AGG CTG ACA TCT GAA GAC TCT GCT ATC TAT 420 
Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Arg Leu Thr Ser Glu Asp Ser Ala He Tyr 

TTC TGT OCA AGA ATG GCC GAC TAT GOT GAC TAC GQC GCT GIG GAC TAC TGG GCT CAA GGG 480 
Phe Cys Ala Arg Mat Ala Asp Tyr Gly His Tyr Gly Ala Val Asp Tyr Trp Gly Gin Gly 

AGC AGG GTC AGC GTC TOC TCA GOT GGA GQC GCT TCA GQC GGA GCT GQC TCT GQC GCT GQC 540 
Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 

GGA TOG GAC ATC CAG ATG ACT CAG TCT OCA TCC TOC CTG TCT GCC TCT CTG GGA- GAC AGA 600 
Gly Ser. Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Leu Gly Asp Arg 

GTC ACC ATC ACT TGC AGG OCA ACT CAG GAC ATT AGC AAT TAT TTA AAC TGG TAT CAG CAG 660 
Val Thr lie Ser Cys Arg Ala Ser Gin Asp He Ser Asn Tyr Leu Asn Trp Tyr Gin Gin 

ACA OCA GAT GGA ACT GTT AAA CTC CTG ATC TAC TAC ACA TCA AGA TTA CAC TCA GGA GTC 720' 
Thr Pro Asp Gly Thr Val lys Leu Leu He Tyr Tyr Thr Ser Arg Leu His Ser Gly Val 

CCA TCA AGG TTC ACT GQC ACT GGG TCT GGA ACA GAT TAT TCT CTC AOC ATT AGC AAC CTG 780 
Pro Ser Arg She Ser Gly Ser Gly Ser Gly Thr Asp Tyr Ser Leu Thr He Ser Asn Leu 

GAG CAA GAA GAT ATT GOC ACT TAC TTT TGC CAA CAG GCT TAT AGG CTT COG TOG AGG TTC 840 
Glu Gin Glu Asp He Ala Thr Tyr Ihe Cys Gin Gin Gly Tyr Thr Leu Pro Trp Thr Phe 

GGT GGA GGC AOC AAG CTG GAG ATC AAA OGG GOG GCC GGA ATC GAG GGA AGG ACT CTG CAA 900 
Gly Gly Gly Thr lys Leu Glu He Lys Arg Ala Ala Ala He Glu Gly Arg Ser Leu Gin 

AAT AAG AAC COC CAC CAG COC ATG ACC CTC ACT TOG CAG GTA CTG TCC CAA ACT GGA GAC 960 
Asn lys Asn Pro His Gin Pro Met Thr Leu Thr Trp Gin Val Leu Ser Gin Thr Gly Asp 

GTT GTC TGG GAT ACA AAG GCA GTC CAG COC GCT TGG ACT TOG TGG COC ACA CTT AAA CCT 1020 
, Val Val Trp Asp Thr Lys Ala Val Gin Pro Pro Trp Thr Trp Trp Pro Thr Leu Lys Pro 

GAT GTA TCT GCC TTC GOG GCT ACT CTT GAG TCC TGG GAT ATC COG GGA ACC GAT GTC TOG 1080 
Asp Val Cys Ala Leu Ala Ala Ser Leu Glu Ser Trp Asp He Pro Gly Thr Asp Val Ser 
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34.1 -GALV-RBD-. Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 



ICC 
Ser 


TCT 
Ser 


AAA 

Lys 


GGA 
Arg 


GTC 
Val 


AGA OCT 
Arg Pro 


fSAf^ 

l3niU £Y«ft. 

Pro Asp Ser 


UtfVL 

Asp 


Tyr 


%nn* nrv* f5nn» i**»n Pis ump inn 
AJUl v^JU vA.X uii. nnu win ivl\* iUJL> 

Thr Ala Ala Tyr Lys Gin lie Thr 




TOG 
Tip 


GGA 
Gly 


GCC 
Ala 


ATA 
lie 


GGG 
Gly 


TGC AGC 
Cys Ser 


TAC OCT CGG 
Tyr Pro Arg 


GCT 
Ala 


AGG 
Arg 


ACT AGA ATG GCA AGC TCT AOC TIC 
Thr Arg Met Ala Ser Ser Thr Phe 


1200 


TAG 
Tyr 


GTA 
Val 


TGT 

cys 


CCC 
Pro 


CGG 
Arg 


GAT GGC 
Asp Gly 


COG ACC CTT 
Arg Thr Leu 


TCA 
Ser 


GAA 
Glu 


GCT AGA AGG TGC GGG GGG CTA GAA 
Ala Arg Arg Cys Gly Gly Leu Glu 


1260 


TOC 
Ser 


CTA 
Leu 


TAC 

Tyr 


TGT 
Cys 


AAA 
Lys 


GAA TOG 
Glu Trp 


GAT TGT GAG 
Asp cys Glu 


AOC 
Thr 


AOG 
Thr 


GGG AOC GGT TAT TOG CTA TCT AAA 
Gly Thr Gly Tyr Trp Leu Ser Lys 


1320 


TOC 
Ser 


TCA 
Ser 


AAA 

Lys 


GAC 
Asp 


CTC 
Leu 


ATA ACT 
lie Thr 


CTA AAA TOG 
Val Lys Trp 


GAC 
Asp 


CAA 

Gin 


AAT AGC GAA TOG ACT CAA AAA TTT 
Asn Ser Glu Trp Tftr Gin Lys Phe 


1380 


CAA 
Gin 


CAS 
Gill 


TCT 
Cys 


CAC 
His- 


CAG 
Gin 


AOC GQC 
Thr Gly 


TOG TGT AAC 

Trp cys Asn 


OOC 
Pro 


err 
Leu 


AAA ATA GAT TIC ACA GAC AAA GGA 
Lys lie Asp Phe Thr Asp Lys Gly 


1440 


AAA 


TTA 
Leu 


TOC 
Ser 


AAG 
Lys 


GAC 
Asp 


TOG ATA 
Trp He 


AOS GGA AAA 
Thr Gly Lys 


AOC 
Thr 


TOG 
Trp 


GGA TTA AGA TOC TAT GIG TCT GGA 
Gly Leu Arg Phe Tyr Val Ser Gly 


1500 


car 

His 


CCA GGC 
Pro Gly 


GTA 
Val 


CAG 
Gin 


TOC AOC 
Che Thr 


ATT OGC .TTA 
He Arg Leu 


AAA 

Lys 


ATC 
lie 


AOC AAC AXG OCA GCT GTS GCA GTA 
Thr Asn Met Pro Ala Val Ala Val 


1560 



GGG 


ATC CAA 


TAC 


CCC 


TAC GAC 


GTC CCC GAC 


TAT 


GCC 


AAT OGA TTA GIT CAA TTT GTT AAA 


1620 


Gly 


He Gin. 


Tyr 


Pro 


Tyr Asp 


Val Pro Asp 


Tyr 


Ala 


Asn Arg Leu Val Gin Phe Val Lys 




GAC 


AGG ATC 


TCA 


GTA 


GTC CAG 


GCT TTA GTC 


CTO 


ACT 


CAA CAA TAC CAC CAG CTA AAA OCA 


1680 


Asp 


Arg He 


Ser 


Val 


Val Gin 


Ala Leu Val 


Leu 


Thr 


Gin Gin Tyr His Gin Leu Lys Pro 




CTA 


GAA TAC 


GAG 


OCA 


TGA 










1698 


Leu 


Glu Tyr 


Glu 


Pro 
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AdelH Sequence 



10 20 30 40 50 60 

12V 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

ATG GCG CGT TCA AGG CTC TCA AAA CCC CCT CAA GAT AAG AIT AAC 006 TOG AAG CCC TEA 60 
Met Ala Arg Ser Thr Leu Ser Lys Fro Pro Gin Asp Lys lie Asn Pro Trp Lys Pro Leu 

ATA GTC ATG GGA GTC CTG TEA GGA GTA GGG ATG OCA GAG AGC CCC CAG GTC TTT AAT GTA 120 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

ACC TOG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GOC ACC TCC CTC CTG GGA 180 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn Ala Thr Set Leu Leu Gly 

ACT GTA CAA GAT GOC TTC OCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 240 
Thr Val Gin. Asp Ala Phe Pro Lys Leu Tyr Ehe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC OCT TCA GAC CAG GAA COG TAT GTC GGG TAT GOC TGC AAG TAC GCC GCA GGG AGA 300 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly cys Lys Tyr Pro Ala Gly Arg 

CAG OGG ACC OGG ACT TTT GAC TTT TAC GTG TGC OCT GGG CAT AOC GTA AAG TOG GGG TGT 360 
Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly Cys 

GGG GGA OCA GGA GAG GOC TAC TGT GGT AAA TGS GGG TGT GAA AOC AOC GGA CAG GOT TAC 420 
Gly Gly Pro Gly Glu Gly Tyr Cys Gly I^s Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

TOG AAG GOC ACA TCA TOG TGG GAC CTA ATC TCC CTT AAG OGC GGT AAC AOC OOC TOG GAC 480 
Trp Lys Pro Thr Ser Ser Trp Asp Leu He Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

ACG GGA TGC TCT AAA GTT GOC TGT GOC OOC TGC TAC GAC CTC TCC AAA OTA TCC AAT TCC 540 
Thr Gly Cys Ser lys Val Ala Cys Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn Ser 

TTC CAA GGG GCT ACT OGA GGG GGC AGA TGC AAC OCT CTA GTC CTA GAA TTC ACT GAT GCA 600 
Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 

GGA AAA AAG GCT AAC TGG GAC GGG OOC AAA' TOG TGG GGA CTG AGA CTG TAC OGG ACA GGA 660 
Gly lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

ACA GAT OCT ATT AOC ATG TTC TCC CTG ACC OGG CAG GTC CTT AAT GTG GGA OOC OGA GTC 720 
Thr Asp Pro lie Thr Met Phe Ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val 

CCC ATA GGG COC AAC OCA GTA TTA OOC GAC CAA AGA CTC CCT TOC TCA CCA ATA GAG ATT 780 
Pro lie Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro Ser Ser Pro He Glu He 

GTA COG GCT OCA CAG CCA OCT AGC COC CTC AAT ACC AGT TAC OOC OCT TOC ACT AOC AGT 840 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr Thr Ser 

w 

ACA OOC TCA AOC TOC OCT ACA AGT CCA AGT GTC OCA CAG OCA OOC OCA GGA ACT GGA GAT 900 
Thr Pro Ser Thr Ser Pro Thr Ser Pro Ser Val Pro Gin Pro Pro Pro Gly Thr Gly Asp 

AGA CTA CTA GCT CTA GTC AAA GGA GOC TAT CAG GCG CTT AAC CTC AOC XAT COC GAC AAG 960 
Arg Leu Leu Ala Leu Val Lys Gly Ala Tyr Gin Ala Leu Asn Leu Thr Asn Pro Asp Lys 

ACC CAA GAA TGT TGG CTG TGC TTA GTG TOG GGA OCT OCT TAT TAC GAA GGA GTA GCG GTC 1020 
Thr Gin Glu Cys Trp Leu Cys Leu Val Ser Gly Pro Pro Tyr Tyr Glu Gly Val Ala Val 

GTG GGC, ACT TAT ACC AAT CAT TCC ACC GCT COG GCC AAC TGT AOG GCC ACT TOC CAA CAT 1080 
Val Gly Thr Tyr Thr Asn His Ser Thr Ala *Pro Ala Asn Cys Thr Ala Thr Ser Gin His 

AAG CTT AOC CTA TCT GAA GTG ACA' GGA CAG GGC CTA TGC ATG GGG GCA GTA OCT AAA ACT 1140 
Lys Leu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu Cys Met Gly Ala Val Pro Lys Thr 

CAC CAG GOC TTA TGT AAC AOC AOC CAA AGC GOC GGC TCA GGA TOC TAC TAC CTT GCA GCA 1200 
His Gin Ala Leu Cys Asn Thr Thr Gin Ser Ala Gly Ser Gly Ser Tyr Tyr Leu Ala Ala 
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AdelH Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

CCC GCC GGA ACA ATG TGG GCT TGC AGC ACT GGA TTGACTCCCTOCTTSTCCACCACGGTC 1260 
Pro Ala Gly Thr Met Trp Ala Cys Ser Thr Gly Leu Thr Pro Cys Leu Ser Tftr Thr Val 



crc 
Leu 


AAT 
Asia 


CTA ADC ACA 
Leu Thr thr 


GAT TAT TGT 
Asp Tyr Cys 


GCA 
Val 


TTA 
Leu 


GTT 
Val 


GAA 
Glu 


CTC TGG 
Leu Trp 


CCC AGA g£a ATT TAC CAC 
Pro Arg Val lie Tyr His 


1320 


TOC 
Ser 


CCC 
Pro 


GAT TAT ATG 
Asp Tyr Met 


TAT GGT CAG 
Tyr Gly Gin 


CTT 
Leu 


GAA 
Glu 


CAG 

Gin 


GGT 
Arg 


ADC AAA 

Thr Lys 


ITOklD HHI^ ^^^*J^ ^ttlk 

LiVX AAA. ASjA GAB ULA vjIA 

Tyr Lys Arg Glu Pro Val 


J_3oU 


TCA 
Ser 


TTG 
Leu 


ACC CTG GOC 
Thr Leu Ala 


CTT CTA CTA 
Leu Leu Leu 


GGA 

Gly 


GGA 
Gly 


TTA 
Leu 


ACC 
thr 


ATS GGA 
Met Gly 


GGG AIT GCA GCT GGA ATA 
Gly He Ala Ala Gly He 


1440 


GGG 
Gly 


AOG 
Thr 


GGG ACC ' ACT 
Gly Tfer Thr 


GOC TEA ATT 
Ala Leu lie 


AAA 

Lys 


ACC 
Thr 


CAG 
Gin 


CAG 
Gin 


TIT GAG 
?be Glu 


CAG CTT CAT GCC GCT ATC 
Gin Leu His Ala Ala He 


1500 


CAG 

Gin 


ACA 
Thr 


GAC CTC AAC 
Asp Leu Asa 


GAA GTC GAA 
Glu Val Glu 


AAG 
Lys 


TCA 
Ser 


ATT 
He 


ACC 
Thr 


AAC CTA 
Asn Leu 


GAA AAG TCA CTG ACC TOG 
Glu Lys Ser Leu Thr Ser 


1560 


TTG 
Leu 


TCT 
Ser. 


GAA GTA GTC 
Glu Val Val 


CTA CAG AAC 
Leu Gin Asn 


OGC 
Arg 


AGA 
Arg 


GOC 

Gly 


CTA 
Leu 


GAT TTG 
Asp Leu 


CTA TTC CTA AAG GAG GGA 
Leu Fhe Leu Lys Glu Gly 


1620 


GST 
Gly 


CDC 
Leu 


TGC GCA GOC 
Cys Ala Ala 


CCA AAA GAA 
Leu Lys Glu 


GAA 
Glu 


TGT 

cys 


TGT 

cys 


TTT 
Fhe 


TAT GCA 
Tyr Ala 


GAC CAC AOG GGG CTA GTG 
Asp His Thr Gly Leu Val 


1680 


AGA 
•Arg 


GAC 
Asp 


AGC ATG GOC 
Ser Met .Ala 


AAA TIA AGA 
Lys Leu Arg 


GAA AGG 
Glu "Arg 


CTT 
Leu 


AAT 
Asn 


CAG AGA 
Gin Arg 


CAA AAA CTA TTT GAG ACA 
Gin Lys Leu Fhe Glu Thr 


1740 



GOC CAA GGA TGG TTC GAA GGG CTG TTT AAT AGA TOC CCC TGG TTT ACC ACC TTA ATC TOC 1800 
Gly Gin Gly Trp Eke Glu Gly Leu ttie Asn Arg Ser Pro Trp Fhe Thr Thr Leu He Ser 



ACC ATC ATG GGA OCT CTA ATA GTA CTC TTA CTG ATC TTA CTC TTT GGA OCT TGC ATT CTC 1860 
Thr He Met Gly Pro Leu He Val Leu Leu Leu He Leu Leu Fhe Gly Pro Cys He Leu 

AAT CGA TTA GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTG ACT 1920 
Asn Arg Leu Val Gin Che Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu Thr 

CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG CCA 1959 
Gin Gin Tyr His Gin Leu lys Pro Leu Glu Tyr Glu Pro 
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PROMOdelH Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 
ATG GOG OCT TCA ACG CTC TCA AAA COC OCT CAA GAT AAG ATT AAC 006 TGG AAG CCC TEA 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys lie Asn Pro Tip Lys Pro Leu 



60 



ATA GTC ATG GGA GTC CTG TEA GGA GTA GGG AUG GCA GAG AGO COC CAG GTC TTT AAT GTA 
He Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 



120 



ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GOC AAT GCC AOC TOC CTC CTG GGA 180 
Thr Trp Arg Val Thr Asn Leu Met Uir Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu Gly 

ACT GTA CAA GAT GOC TIC OCA AAA TEA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 240 
Tbi Val Gin Asp Ala Phe Pro Lys Leu Tyr Hie Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC OCT TCA GAC CAG GAA COG TOT GTC GGG TAT GOC TGC AAG TAC CCC GCA GGG AGA 300 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG COG ACC CGG ACT TTT GAC TTT TAC CTG TGC OCT GGG CAT ACC GTA AAG TOG GGG TGT 360 
Glu Arg Thr Arg Tnr Phe Asp Hie Tyr Val cys Pro Gly His Thr Val Lys Ser Gly Cys 

GGG GGA CCA GGA GAG GOC TAC TGT GGT AAA. TOG GGG TGT GAA ACC AOC GGA CAG GCT TAC 420 
Gly Gly Pro Gly Glu Gly Tyr CJys Gly Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

TGG AAG GOC ACA TCA TOG TGG GAC CTA ATC TOC CTT AAG CGC GGT AAC ACC COC TGG GAC 480 
Trjp Lys Pro Thr Ser Ser Trp Asp Leu lie Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

ACG GGA TGC TCP AAA GTT GCC TGT GOC COC TGC TAC GAC CTC TOC AAA GTA TOC AAT TOC 540 
Thr Gly cys Ser lys Val Ala Cys Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn Ser 

TTC CAA GGG GCT ACT GGA GGG GGC AGA TGC AAC OCT CTA GTC CTA GAA TTC ACT GAT GCA 600 
Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 

GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TGG TGG GGA CTG AGA CTG TAC CGG ACA GGA 660 
Gly Lys Lys Ala Asn Trp Asp Gly Pro lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

ACA GAT CCP ATT AOC ATG TTC TOC CTG AOC COG CAG GTC CTT AAT GTG GGA OOC OGA GTC 720 
Thr Asp Pro He Thr Met Hie Ser Leu Tnr Arg Gin Val Leu. Asn Val Gly Pro Arg Val 

OCC AXA GGG CCC AAC COC GTT CTG GCA GAC CAA CAG CCA CTC TOC AAG CCC AAA CCT GTT 780 
Pro lie Gly Pro Ash Pro Val Leu Ala Asp Gin Gin Pro Leu Ser Lys Pro Lys Pro Val 

AAG TOG CCT TCA GTC AOC AAA OCA CCC AGT GGG ACT CCT CTC TCC OCT AOC CAA CTT OCA 840 
Lys Ser Pro Ser Val Thr Lys Pro Pro Ser Gly Thr Pro Leu Ser Pro Thr Gin Leu Pro 



CTA 
Leu 


GTC 
Val 


AAA 

Lys 


GGA 
Gly 


GOC TAT CAG GOG CTT AAC 
Ala Tyr Gin Ala Leu Asn 


900 


TGG 
Trp 


CTG 
Leu 


TGC 
Cys 


TEA 

Leu 


GTG TOG GGA CCT CCT TAT 
Val Ser Gly Pro Pro Tyr 


960 


ACC 
Thr 


AAT 
Asn 


CAT 
His 


TCC 
Ser 


ACC GCT CCG GOC AAC TGT 
Thr Ala Pro Ala Asn Cys 


1020 


TCT 

Ser 


GAA 
Glu 


GTG 
Val 


ACA 
Thr 


GGA CAG GGC CTA TGC ATG 
Gly Gin Gly Leu Cys Met 


1080 


TGT 
Cys 


AAC 
Asn 


AOC 
Tnr 


ACC 
Tnr 


CAA AGC GCC GGC TCA GGA 
Gin Ser Ala Gly Ser Gly 


1140 


ATG 
Met 


TGG 
Trp 


GCT 
Ala 


TGC 
Cys 


AGC ACT GGA TTG ACT CCC 
Ser Thr Gly Leu Thr Pro 


1200 
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PROMOdelH Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 34S 678 901 234 567 890 
TGC TTG TCC ACC ACG GTS CTC AAT CTA ACC ACA GAT TAT TGT GTA TTA GTT GAA CTC TGG 1260 
Cys Leu Ser The Thr Val Leu Asn teu Thr Thr Asp Tyr Cys Val Leu Val Glu Leu Trp 

OX AGA GTA ATP TAC CAC TOC COC GAT TAT ATG TAT GGT CAG CTT GAA CAG CGT ACC AAA 1320 
ProArgVallleTyrHisSerProAsp Tyr ifet Tyr Gly Gin Leu Glu Gin Arg Thr Lys 

TAT AAA AGA GAG CCA GTA TCA TTG ACC CTG GOC CTT CTA CTA OGA GGA TTA ACC ATG GGA 1380 
Tyr LyB Arg Glu Pro Val Ser Leu Thr Leu Ala I^u Leu Leu Gly Gly Leu Thr Met Gly 

(EG ATT GCA GCT GGA ATA GGG AOG Q33 ACC ACT QOC TTA ATT AAA ACC CAG CAG TTT GAG 1440 
cay lie Ala Ala Gly lie Gly Thr Gly Thr Thr Ala Leu He Lys Thr Gin Gin Phe Glu 

CAG CTT CAT GCC GCT ATC CAG ACA GAC CTC AAC GAA GTC GAA AAG TCA ATT ACC AAC CTA 1S00 
CUn Leu His Ala Ala lie Gin Thr Asp Leu Asn Glu Val Glu Lys Ser lie Thr Asn Leu 

GAA AAG TCA CTG ACC TCG TTG TCT GAA GTA GTC CTA CAG AAC CGC AGA GGC CTA GAT TTG 1560 
GCLtt Lys Ser Leu Thr Ser Leu Ser Glu Val Val Leu Gin Asn Arg Arg Gly Leu Asp Lea 

CTA TTC CTA AAG GBG GGA GGT CTC TGC GCA GOC CTA AAA GAA GAA TGT TGT TTT TAT GCA 1620 
leu Fne Leu Lys Glu Gly Gly Leu Cys Ala Ala Leu' Lys Glu Glu Cys Cys Phe Tyr Ala 

GRC CAC ACG GOG CTA GTC AGA GAC AGC ATG GOC AAA TEA AGA GAA AGG CTT AAT CAG AGA 1680 
Asp His Thr Gly Leu Val Arg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn Gin Arg 

CAA AAA CTA TTT GAG ACA Q3C CAA GSA TGG TTC GAA G3G CTC TTT AAT AGA TOC CCC TGG 1740 
Gin Lys Leu Phe Glu Tar Gly Gin Gly ttp Phe Glu Gly Leu Phe Asn Arg Ser Pro Trp 

^^^^^^^^^^^^^^^^CTOATCTTACTC 1800 
Phe Thr Thr Leu lie Ser Thr He Met Gly Pro Leu lie Val Leu Leu Leu He Leu Leu 

r^^^^ OTm ^ mOTMmOTmGfiCffiG MX:TCACmGIC I860 
Eta Gly Pro Cys lie Leu Asn Arg Leu Val Gin Phe Val lys Asp Arg lie Ser Val Val 

CAG GCT TTA GTC CTG ACT CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG OCA 1517 
Gin Ala Leu Val Leu Thr Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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GALVdelH Sequence 



10 20 30 40 50 60 

123 456 789 012 34 5 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

ATG GTA TTG CTG CCT QQG TOC ATG CTT CTC ACC TCA AAC CTG CAC CAC CTT COG CAC CAG 60 
Met Val Leu Leu Pro Gly Ser Met Leu Leu Tfar Ser Asn Leu His His Leu Arg His Gin 

ATG AGT OCT GGG AGC TOG AAA AGA CTG ATC ATC CTC TTA AGC TGC GTA TTC GGC GGC GGC 120 
Met Ser Pro Gly Ser Trp Lys Arg Leu lie lie -Leu Leu Ser Cys Val Phe Gly Gly Gly 

GGG AOG AGT CTG CAG AAT AAG AAC COC CAG CCC ATG ACC CTC ACT TOG CAG GTA. CTG TCC 180 
Gly Thr Ser Leu Gin Asn Lys Asn Pro Gin Pro Met Thr Leu Thr Trp Gl& Val Leu Ser 

CAA ACT GGA GAC GTT GTC TGG GAT ACA AAG GCA GTC CAG CCC CCT TOG ACT TOG TGG CCC 240 
Gin Thr Gly Asp Val Val Trp Asp Thr Lys Ala Val Gin Pro Pro Trp Thr Trp Trp Pro 

ACA CTT AAA CCT GAT GTA TGT GCC TTG GOG GOT AGT CTT GAG TOC TGG GAT ATC COG GGA 300 
Thr Leu Lys pro Asp Val Cys Ala Leu Ala Ala Ser Leu Glu Ser Trp Asp lie Pro Gly 

ACC GAT GTC TOG TOC TCP AAA OGA GTC AGA OCT COG GAC TCA GAC TAT ACT GCC OCT TM 360 
Thr Asp Val Ser Ser Ser Lys Arg Val Arg Pro Pro Asp Ser Asp Tyr Thr Ala Ala Tyr 

AAG CAA ATC ACC TOG GGA GCC ATA GGG TGC AGC TAC OCT COG GOT AGO ACT AGA ATG GCA 420 
Lys Gin lie Thr Trp Gly Ala lie Gly Cys Ser Tyr Pro Arg Ala Arg Thr Arg Met Ala 

AGC TCT ACC TTC TAC GTA TGT CCC GGG GAT GGC GGG ACC CTT TCA GAA GCT AGA AGG TGC 480 
Ser Ser Thr Phe Tyr Val Cys Pro Arg Asp Gly Arg Thr Leu Ser Glu Ala Arg Arg Cys 

GGG GOG CCA GAA TOC CCA TAC TGT AAA GAA TGG GAT TGT GAG ACC ACQ GOG . ACC OCT TAT 540 
Gly Gly Leu Glu Ser Leu Tyr Cys lys Glu Trp Asp Cys Glu Thr Thr Gly Thr Gly Tyr 

TGG CIA. TCT AAA TOC TCA AAA GAC CTC ATA ACT GTA AAA TGG GAC CAA AAT AGC GAA TGG 600 
Trp Leu Ser Lys Ser Ser Lys Asp Leu He Thr Val Lys Trp Asp Gin Asn Ser Glu Trp 

ACT CAA AAA TTT CAA CAG TGT CAC CAG ACC GGC TGG TCT AAC COC CTT AAA ATA GAT TTC 660 
Thr Gin Lys Phe Glu Gin cys His Gin Thr Gly Trp Cys Asn Pro Leu Lys lie Asp Phe 

ACA GAC AAA GGA AAA TTA TOC AAG GAC TGG ATA AOG GGA AAA ACC TGG GGA TTA AGA TTC 720 
Thr Asp Lys Gly Lys Leu Ser Lys Asp Trp He Thr Gly lys Thr Trp Gly Leu Arg Phe 

TAT GTG TCT GGA CAT OCA GGC GTA CAG TTC ACC ATT OGC TTA AAA ATC ACC AAC ATG OCA 780. 
Tyr Val Ser Gly His Pro Gly Val Gin Phe Thr He Arg Leu Lys He Thr Asn Met Pro 

GCT GTC GCA GTA GGT CCT GAC CTC GTC CTT GTG GAA CAA GGA OCT OCT AGA AOG TOC CTC 840 
Ala Val Ala Val Gly Pro Asp Leu Val Leu Val Glu Gin Gly Pro Pro Arg Thr Ser Leu 

GCT CTC OCA OCT OCT CTT CCC OCA AGG GAA GOG OCA COG OCA TCT CTC COC GAC TCT AAC 900 
Ala Leu Pro Pro Pro Leu Pro Pro Arg Glu* Ala Pro Pro Pro Ser Leu Pro Asp Ser Asn 

TOC ACA GCC CTG GOG ACT AGT GCA CAA ACT* CCC AGG GTG AGA AAA ACA ATT GIT ACC CTA 960 
Ser Thr Ala Leu Ala Thr Ser Ala Gin Thr Pro Thr Val Arg Lys Thr He Val Thr Leu 

AAC ACT OCG OCT CCC ACC ACA GGC GAC AGA CTT TTT GAT CTT GTG CAG GGG GOC TTC CTA 1020 
Asn Thr Pro Pro Pro Thr Thr Gly Asp Arg Leu Phe Asp Leu Val Gin Gly Ala Phe Leu 

ACC TTA AAT GCT ACC AAC CCA GGG GOC ACT GAG TCT TGC TGG CTT TGT TTG GOC ATG GGC 1080 
Thr Leu Asn Ala Thr Asn Pro Gly Ala Thr Glu Ser Cys Trp Leu Cys Leu Ala Met Gly 

CCC CCT TAT TAT GAA GCA ATA GCC TCA TCA GGA GAG GTC GCC TAC TCC ACC GAC CTT GAC 1140 
Pro Pro Tyr Tyr Glu Ala He Ala Ser Ser Gly Glu Val Ala Tyr Ser Thr Asp Leu Asp 

CGG TGC OGC TGG GGG AOC CAA GGA AAG CTC ACC CTC ACT GAG GTC TCA GGA CAC GGG TTG 1200 
Arg Cys Arg Trp Gly Thr Gin Gly Lys Leu Thr Leu Thr Glu Val Ser Gly His Gly Leu 
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GALVdelH Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 567 890 

TGC ATA GGA AAG GTG OOC TTT AOC CAT CAG CAT CTC TGC AAT CAG ACC CCA TQC ATC AAT 1260 
Cys lie Gly Lys Val Pro Ehe Thr His Gin His Leu Cys Asn Gin Thr Leu Ser lie Asn 

TCC TOC GGA GAC CAT CAG TAT CTG CTC COC TOC AAC CAT ADC TOG TGG GC£ TGC AGC ACT 1320 
Ser Ser Gly Asp His Gin Tyr Leu Leu Pro Ser Asn His Ser Trp Trp Ala^Cys Ser Thr 

GGC CTC ACC CCT TGC CTC TCC ACC TCA GIT TTT AAT CAG ACT AGA GAT TTC TOP ATC CAG 1380 
Gly Leu Thr Pro Cys Leu Ser Thr Ser Val Ehe Asn Gin Thr Arg Asp Pha Cys lie Gin 

GTC CAG CTG ATT CCT OOC ATC TAT TAC TAT. CCT GAA GAA GTT TTG TEA CAG GCC TAT GAC 1440 
Val Gin Leu He Pro Arg lie Tyr Tyr Tyr Pro Glu Glu Val Leu Leu Gin Ala Tyr Asp 

AAT TCT CAC COC AGG ACT AAA AGA GAG GCT GTC TCA CTT ACC CTA GCT GTT TEA CTG GGG 1500 
Asn Ser His Pro Arg Thr lys Arg Glu Ala Val Ser Leu Thr Leu Ala Val Leu Leu Gly 

TTG GGA ATC AOG GOG GGA ATA GCT ACT GCT TCA ACT GCC TEA ATT AAA GGA OCT ATA GAC 1560 
Leu Gly lie Thr Ala Gly lie Gly Thr Gly Ser Thr Ala Leu He Lys Gly Pro He Asp 

CTC CAG GAA GGC CTG ACA AGC CTC CAG ATC GCC AXA GAT GCT GAC CTC OGG GOC CTC CAA 1620 
Leu Gin Gin Gly Leu Thr Ser Leu Gin He Ala He Asp Ala Asp Leu Arg Ala Leu Gin 

GAC TCA GTC AGC AAG TTA GAG GAC TCA CTG ACT TOC CTG TOC GAG GTA GTG CTC CAA AAT 1680 
Asp Ser Val Ser lys Leu Glu Asp Ser Leu Thr Ser Leu Ser Glu Val Val Leu Gin Asn 

AGG AGA GGC CTT GAC TTG CTG TTT CTA AAA GAA GCT GGC CTC TCT GOG GOC CTA AAG GAA 1740 
Arg Arg Gly Leu Asp Leu Leu Ehe Leu Lys Glu Gly Gly Leu Cys Ala Ala Leu Lys Glu 

GAC TGC TCT TTT TAC ATA OC CAC TCA GCT GCA CTA OGG GAC TOC ATG AAA AAA CTC AAA 1800 
Glu Cys Cys Phe Tyr He Asp His Ser Gly Ala Val Arg Asp Ser Ifet Lys Lys Leu Lys 

GAA AAA CTG GAT AAA AGA CAG TTA GAG OGC CAG AAA AGC CAA AAC TGG TAT GAA GGA TGG 1860 
Glu Lys Leu Asp Lys Arg Gin Leu Glu Arg Gin lys Ser Gin Asn Trp Tyr Glu Gly Trp 

TTC AAT AAC TOC OCT TOG TTC ACT ACC CTG CTA TCA ACC ATC GCT GGG OOC CTA TTA CTC 1920 
Phe Asn Asn Ser Pro Trp Ehe Thr Thr Leu Leu Ser Thr He Ala Gly Pro Leu Leu Leu 

CTC CTT CTG TTG CTC ATC CTC GGG CCA TGC ATC ATC AAT OGA TTA GTT CAA TTT GTT AAA 1980 
Leu Leu Leu Leu Leu He Leu Gly Pro Cys He He Asn Arg Leu Val Gin Ehe Val Lys 

GAC AGG ATC TCA GTA GTC CAG GCT TTA GTC CTG ACT CAA CAA TAC CAC CAG CTA AAG OCT 2040 
Asp Arg He Ser Val Val Gin Ala Leu Val Leu Thr Gin Gin Tyr His Gin Leu Lys Pro 

ATA GAG TAC GAG OCA 2055 
He Glu Tyr Glu Pro 
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C2delH Sequence 



10 20 30 40 50 60 

123 456 789 012 345 678 901 234 567 890 123 456 789 012 345 678 901 234 S67 890 

ATG GOG OGT TCA ACG CTC TCA AAA COC OCT CAA GAT AAG ATT AAC COG TGG AAG CCC TTA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys He Asn Pro Trp Lys Pro Leu 

ATk GTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGO COC CAG GTC TTT AAT GTA 120 
He Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

ACC TOG AGA GTC ADC AAC CTG ATG ACT GGG OGT ACC GOC AAT GCC AOC TOf CTC CTG GGA 180 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Tfar Ala Asn Ala Thr Ser Leu Leu Gly 

ACT GTA CAA GAT GOC TTC OCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 240 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Ifce Asp Leu Cys Asp Leu Val Gly Glu Glu 

TOG GAC OCT TCA GAC CAG GAA COG TAT GTC GGG TAT GOC TGC AAG TAC OOC GCA GGG AGA 300 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly cys Lys Tyr Pro Ala Gly Arg 

CAG OGG ACC GGG ACT TTT GAC TTT TAC GTG TGC OCT GGG CAT AOC GTA AAG TOG GGG TGT 360 
Gin Arg Tbr Arg dr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly Cys 

GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA TGG GGG TGT GAA AOC ACC GGA CAG GCT TAC 420 
Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

TGG AAG COC ACA TCA TOG TGG GAC CTA ATC TCC CTT AAG CGC GGT AAC ACC COC TGG GAC 480 
Trp Lys Pro Thr . Ser Ser Trp Asp Leu He Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

ACQ GGA TGC TCT AAA GTT GOC TGT GGC COC TGC TAC GAC CTC TOC AAA GTA TOC AAT TOC 540 
Thr Gly Cys Ser lys Val Ala Cys Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn Ser 

TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC OCT CIA GTC CTA GAA TTC ACT GAT GCA 600 
Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu She Thr Asp Ala 

GGA AAA AAG GCT AAC TGG GAC GGG GOC AAA TGG TGG GGA CTG AGA CTG TAC OGG ACA GGA 660 
Gly. Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

ACA GAT OCT ATT AOC ATG TTC TOC CTC ACC COG CAG GTC CTT AAT GTG GGA COC CGA GTC 720 
Thr Asp Pro lie Thr Met Hie Ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val 

CCC ATA GGG COC AAC CCA GTA TTA COC GAC CAA AGA CTC OCT TOC TCA CCA ATA GAG ATT 780- 
Pro He Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro Ser Ser Pro He Glu He 

GTA COG GCT OCA CAG CCA OCT AGO COC CTC AAT ACC AGP TAC COC OCT TOC ACT ACC AGT 840 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr Ttor Ser 

ACA CCC CCC OCA GGT ACC GGA GAT AGA CTA CTA GCT CTA GTC AAA GGA GCC TAT CAG GOG 900 
Thr Pro Pro Pro Gly Thr Gly Asp Arg Leu Leu Ala Leu Val Lys Gly Ala Tyr Gin Ala 

CTT AAC CTC ACC AAT COC GAC AAG AOC CAA GAA TGT TGG CTG TGC TTA GTG TOG GGA OCT 960 
Leu Asn Leu Thr Asn Pro Asp Lys Thr Gin Glu Cys Trp Leu Cys Leu Val Ser Gly Pro 

CCT TAT TAC GAA GGA GTA GOG GTC GTG GGC ACT TAT ACC AAT CAT TCC ACC GCT COG GOC 1020 
Pro Tyr Tyr Glu Gly Val Ala Val Val Gly Thr Tyr Thr Asn His Ser Thr Ala Pro Ala 

AAC TGT ACG GOC ACT TCC CAA CAT AAG CTT AOC CTA TCT GAA GIG ACA GGA CAG GGC CTA 1080 
Asn Cys Thr Ala Thr Ser Gin His Lys Leu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu 

TGC ATG GGG GCA GTA CCT AAA ACT CAC CAG GOC TEA TGT AAC AOC ACC CAA AGC GCC GGC 1140 
Cys Met Gly Ala Val Pro Lys Thr His Gin Ala Leu Cys Asn Thr Thr Gin Ser Ala Gly 

TCA GGA TCC TAC TAC CTT GCA GCA COC GCC GGA ACA ATG TGG GCT TGC AGC ACT GGA TTG 1200 
Ser Gly Ser Tyr Tyr Leu Ala Ala Pro Ala Gly Thr Met Trp Ala Cys Ser Thr Gly Leu 
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ACT CCC TCC TTG TCC ACC ACG GTG CTC AAT CEA ADC ACA GAT TAT TGT GTA TTA GTT GAA 
Tbr Pro Cys Leu Ser Tbr Thr Val Leu Asn Leu Thr Thr Asp Tyr Cys \fcl Leu Val. Glu 



1260 



CTC TOG CCC AGA GTA ATT TAC CAC TCC CCC GAT TAT ATG TAT GGT GAG CTT GAA GAG OGT 1320 
Leu Trp Pro Arg Val lie Tyr His Ser Pro Asp Tyr Met Tyr Gly Gin Leu Glu Gin Arg 

ACC AAA TAT AAA AGA GAG CCA GTA TCA TTG AOC JCTG GCC CTT CTA CTA GGA GGA TEA ACC 1380 
Thr Lys Tyr Lys Arg Glu Pro Val Ser Leu Tbr Leu Ala Leu Leu Leu Gly Gly Leu Thr 



ATG GGA GGG ATT OCA GCT GGA ATA GOG AOG GOG 
Met Gly Gly He Ala Ala Gly lie Gly Thr Gly 



AOC ACT GCC TEA ATT AAA AOC GAG CAG 
Thr Thr Ala Leu He Lys Thr -Gin Gin 



1440 



TTT GAG CAG CTT GAT GCC GCT ATC CAG ACA GAC 
She Glu Gin Leu Bis Ala Ala lie Gin Thr Asp 



CTC AAC GAA GTC GZA AAG TCA ATT AOC 
Leu Asn Glu Val Glu Lys Ser lie Thr 



1500 



AAC CTA GAA AAG TCA CTG AOC TOG TTG TCT GAA 
Asn Leu Glu Lys . Ser Leu Thr Ser Leu Ser Glu 



GTA GTC CTA CAG AAC OGC AGA GGC CTA 
Val Val Leu Gin Asn Arg Arg Gly Leu 



1560 



GAT TTG CTA TTC CTA AAG GAG GGA GGT CTC TOC 
Asp Leu Leu Efca Leu Lys Glu Gly Gly Leu Cys 



GCA GCC CTA AAA GAA GAA TGT TGT TTT 
Ala Ala Leu Lys Glu Glu Cys Cys Phe 



1620 



EAT GCA GAC CAC AOG GGG CIA GIG AGA GAC AOC 
ryr Ala Asp His Thr Gly Leu Val Arg Asp Ser 



ATG GCC AAA TTA AGA GAA AGG CTT AAT 
Met Ala lys Leu Arg Glu Arg Leu Asn 



1680 



CAG AGA GAA AAA CTA TTT GAG ACA GGC GAA GGA 
Gin Arg Gin Lys Leu Phe Glu Thr Gly Gin Gly 



TOG TTC GAA GGG CTG TTT AAT AGA TOC 
Trp Phe Glu Gly Leu Phe Asn Arg Ser 



1740 



CCC TOG TTT AOC AOC TTA ATC TOC AOC ATC ATG 
Pro Trp Phe Thr Thr Leu He Ser Thr He Met 



GGA OCT CTA ATA GTA CTC TTA CTG ATC 
Gly Pro Leu lie Val Leu Leu Leu lie 



1800 



TTA CTC TTT GGA OCT TOC ATT CTC AAT OGA TTG 
leu Leu Che Gly Pro Cys He Leu Asn Arg Leu 



GTC GAA TTT GTT AAA GAC AGG ATC TCA 
Val Gin Phe Val Lys Asp Arg lie Ser 



1860 



GTS GTC CAG GCT CTG GTT TTG ACT CAG CAA TAT 
Val Val Gin Ala Leu Val Leu Thr Gin Gin Tyr 



CAC CAG CTA AAA CCC AXA GAG TAC GAG 
His Gin Leu Lys Pro He Glu Tyr Glu 



1920 



OCA TGA 
Pro , . « 



1926 
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AdelHRIess Sequence 



PCI7EP00/04534 



123 


10 

456 789 012 


345 


678 


20 
901 


234 


567 


30 40 50 60 
890 123 456 789 012 345 678 901 234 567 890 




ATG 
Met 


GOG CGT 
Ala Arg 


TCA 
Ser 


ACG 
Bur 


CTC 
Leu 


TCA 
Ser 


AAA 

Lys 


CCC 
Pro 


CCT CAA GAT AAG ATT AAC COG TGG AAG CCC TTA 
Pro Gin Asp Lys lie Asn Pro Trp Lys Pro Leu 


60 


ATA 
Xle 


GTC ATC 
Val Met 


GGA 
Gly 


GTC 

val 


CTG 
Leu 


TEA 
Leu 


GGA 
Gly 


GTA 
Val 


GOG ATG OCA GAG AGC COC CAG GTC TTT AAT GTA 
Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 


120 


ACC 
Thr 


TGG AGA 
Trp Arg 


GTC 
Val 


ACC 
Thr 


AAC 
Asn 


CTG 
Leu 


ATG 
Met 


ACT 
Thr 


GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG GGA 
Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu Gly 


180 


ACT 
Thr 


GTA CAA 
Val Gin 


GAT 
Asp 


GCC 
Ala 


TTC 
Phe 


CCA 
Pro 


AAA 
Lys 


TTA 
Leu 


TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 
Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 


240 


TOG 
Tip 


GAC OCT 
Asp Pro 


TCA 
Ser 


GAC 
Asp 


CAG 
Gin 


GAA 
Glu 


COG 
Pro 


TAT 
T"yr 


GTC GGG TAT GGC TQC AAG TAC COC GCA GGG AGA 
Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 


300 


CAS 
Gin 


OGG ACC 
Arg Thr 


COG 
Arg 


ACT 
Thr 


TTT 
Phe 


GAC 
Asp 


TTT 
Phe 


TAC 
Tyr 


GTG TQC OCT GGG CAT ACC GTA AAG TOG GGG TGT 
Val Cys Pro Gly His Thr Val Lys Ser Gly Cys 


360 


OG8 
Gly 


GGA OCA 
Gly Pro 


GGA 
Gly 


GAG 
Glu 


GGC TAC 
Gly Tyr 


TGT 
Cys 


GGT 
Gly 


AAA TGG GGG TGT GAA ACC ACC GGA CAG GCT TAC 
Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 


420 


TOG 
Tip 


AAG CCC 
Lys Pro 


ACA 
Thr 


TCA 
Ser 


TOG 
Ser 


TGG 
Trp 


GAC 
Asp 


CTA 
Leu 


ATC TCC CTT AAG CGC GGT AAC ACC CCQ TGG GAC 
lie Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 


480 


ACQ 
Thr 


GGA TQC 
Gly Cys 


TCT 
Ser 


AAA 

lys 


GTT 
Val 


GCC 

Ala 


TGT 

cys 


GGC 
Gly 


COC TQC TAC GAC CTC TCC AAA GTA TCC AAT TCC 
Pro cys Tyr Asp Leu Ser lys Val Ser Asn Ser 


540 


TTC 
Phe 


CAA GGG 
Gin Gly 


QCT 

Ala 


ACT 
Thr 


CGA 
Arg 


GGG 
Gly 


GGC 
Gly 


AGA 
Arg 


TQC AAC CCT CTA CTC CTA GAA TTC ACT GAT GCA 
Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 


600 


GG\ 
Gly 


AAA AAG 

Lys Lys 


GCT 

Ala 


AAC 
Asn 


TGG 
Trp 


GAC 
Asp 


GGG 
Gly 


CCC 
Pro 


AAA TOG TGG GGA CTG AGA CTG TAC COG ACA GGA 
Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 


660 


AO. 
The 


GAT OCT 
Asp Pro 


ATT 

lie 


ACC 
Thr 


ATG 
Met 


TTC 
She 


TCC 
Ser 


CTG 
Leu 


ADC COG CAG GTC CTT AAT GTG GGA COC CGA GTC 
Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val 


720 


CCC 
Pro 


ATA GOG 
lie Gly 


CCC 
Pro 


AAC 
Asn 


OCA 
Pro 


GTA 
Val 


TTA 
Leu 


CCC 
Pro 


GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG ATT 
Asp Gin Arg Leu Pro Ser Ser Pro He Glu He 


780 


GTA 
Val 


COG OCT 
Pro Ala 


CCA 
Pro 


CAG 
Gin 


CCA 
Pro 


OCT 
Pro 


AGC 
Ser 


COC 
Pro 


CTC AAT ACC AGT TAC CCC CCT TCC ACT ACC ACT 
Leu Asn Thr Ser Tyr Pro Pro Ser Thr Thr Ser 


840 


AC* 
Thr 


COC TGA 
Pro Ser 


ACC 
Thr 


TCC 
Ser 


OCT 
Pro 


ACA 

Thr 


ACT 
Ser 


CCA 
Pro 


AGT GTC CCA CAG OCA COC OCA GGA ACT GGA GAT 
Ser Val Pro Gin Pro Pro Pro Gly Thr Gly Asp 


900 


AGX 
Arg 


OTA CTA 
Leu Leu 


GCT 
Ala 


CTA 
Leu 


GTC 
Val 


AAA 
Lys 


GGA 
Gly 


GCC 
Ala 


TAT CAG GOG CTT AAC CTC ACC AAT CCC GAC AAG 
Tyr Gin Ala Leu Asn Leu Thr Asn Pro Asp Lys 


960 


ACC 
Thr 


CAA GAA 
Gin Glu 


TGT 
Cys 


TGG 
Trp 


CTG 
Leu 


TQC 
cys 


TTA 
Leu 


GTG 
Val 


TOG GGA OCT CCT TAT TAC GAA GGA GTA GOG GTC 
Ser Gly Pro Pro Tyr Tyr Glu Gly Val Ala Val 


1020 


GTG 
Val 


GGC ACT 
Gly Hit 


TAT 
Tyr 


ACC 
Thr 


AAT 
Asn 


CAT 
His 


TCC 
Ser 


ACC 
Thr 


GCT COG GCC AAC TGT ACG GCC ACT TCC CAA CAT 
Ala Pro Ala Asn Cys Thr Ala Thr Ser Gin His 


1080 


AAG 
Lys 


CTT ACC 
Leu Thr 


CTA 
Leu 


TCT 
Ser 


GAA 
Glu 


GTG 
Val 


ACA 
Thr 


GGA 
Gly 


CAG GGC CTA TQC ATG GGG GCA GTA CCT AAA ACT 
Gin Gly Leu Cys Met Gly Ala Val Pro Lys Thr 


1140 


CAC 
His 


CAG GCC 
Gin Ala 


TEA 
Leu 


TGT 
Cys 


AAC 
Asn 


ACC 
Thr 


ACC 
Thr 


CAA 
Gin 


AGC GOC GGC TCA GGA TOC TAC TAC CTT GCA GCA 
Ser Ala Gly Ser Gly Ser Tyr Tyr Leu Ala Ala 


1200 
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CCC GCC OGA ACA ATG TGG OCT TGC AGC ACT GGA TIG ACT CCC TGC TTG TCC.ACC ACG GTG 1260 
Pro Ala Gly Thr Met Trp Ala Cys Ser Thr Gly Leu Thr Pro Cys Leu Servitor Thr Val 



CTC AAT CTA AGC ACA GAT TAT TOP GTA TTA GTT GAA CTC TOG COC AGA GTA ACT TAC CAC 1320 
Leu Asn Leu Thr Thr Asp Tyr Cys Val Leu Val Glu Leu Trp Pro Arg Val lie Tyr His 

TCC COC GAT TAT ATG TAT GOT GAG CTT GAA CAG OGT ACC AAA TAT AAA AGA GAG CCA GTA 1380 
Ser Pro Asp Tyr Met Tyr Gly Gin Leu Glu Gin Arg Thr Lys Tyr Lys Arg Glu Pro Val 

TCA TTG ACC CTG GCC CTT CTA CTA GGA GGA TTA ACC. ATG GGA GOG AIT GCA OCT GGA ATA 1440 
Ser Leu Thr Leu Ala Leu Leu Lea Gly Gly Leu Thr Met Gly Gly lie Ala Ala Gly lie 

GGG ACG GGG ACC ACT GCC TTA ATT AAA ACC CAG CAS TTT GAG CAG CTT CAT GCC GCT ATC 1500 
Gly Thr Gly Thr Thr Ala Leu lie Lys Thr Gin Gin Phe Glu Gin Leu His Ala Ala He 

CAG ACA GAC CTC AAC GAA GTC GAA AAG TCA ATT ACC AAC CTA GAA AAG TCA CTG ACC TOG 1560 
Gin Thr Asp Leu Asn Glu Val Glu Lys Ser He Thr Asn Leu Glu Lys Ser Leu Thr Ser 

TTG TCT GAA GTA GTC CIA CAG AAC GGC AGA GGC CTA GAT TTG CTA TIC CTA AAG GAG GGA 1620 
Leu Ser Glu Val val Leu Gin Asn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly 

GOT CTC TGC GCA GCC CTA AAA GAA GAA TGT TCT TTT TAT GCA GAC CAC ACG GGG CTA GTG 1680 
Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys cys Phe Tyr Ala Asp His Thr Gly Leu Val 

AGA GAC AGC ATG GCC AAA TTA AGA GAA AGG CTT AAT CAG AGA CAA AAA CTA TTT GAG ACA 1740 
Arg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu Thr 

GGC CAA GGA TGG TTC GAA GGG CTG TTT AAT AGA TCC COC TGG TTT ACC ACC TTA ATC TCC 1800 
Gly Gin Gly Trp Phe Glu Gly Leu Phe Asn Arg Ser Pro Trp Phe Thr Thr Leu He Ser 

ACC ATC ATG GGA OCT CTA ATA CTA CTC TTA CTG ATC TTA CTC TTT GGA OCT TGC ATT CTC 1860 ' 
Thr He Met Gly Pro Leu He Val Leu Leu Leu He Leu Leu Phe Gly Pro Cys He Leu 

AAT CGA TTA GTT CAA TTT GTT AAA GAC AGG ATC TCA OTA GTC CAG GCT TTA 1911 
Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu 
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ATG 006 CGT TCA ACG CTC TCA AAA CCC CCT CAA GAT AAG ATT AAC OCG TOG AAG CCC TEA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys lie Asn Pro Trp Lys Pro Leu 

ATA GTC ATG GGA GTC CTC TEA GGA GTA OGG ATG OCA GAG AGO COC CAG GTC TTT AAT GTA 120 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

ACC TOG AGA GTC ADC AAC CTG ATG ACT GOG CCT ACC GOC AAT GOC ACC TCC CTC CTG GGA 180 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asn Ala Thr Ser 'Leu Leu Gly 

ACT GTA CAA GAT GOC TTC CCA AAA TEA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 240 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TOG GAC CCT TCA GAC CAG GAA COG TAT GTC GGG TAT GOC TGC AAG TAG CQC GCA GOG AGA 300 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG OGG ACC COO ACT TTT GAC TTT TAG GTG TGC CCT GGG CAT ACC GTA AAG TOG GOG TGT 360 
Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly Cys 

GOG GGA OCA GGA GAG GOC TAC TGT GOT AAA TOO GOG TGT GAA ACC ACC GGA CAG OCT TAG 420 
Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

TOG AAG CCC ACA TCA TOO TOO QAC CEA ATC TCC CTT AAG GOC GGT AAC ACC CCC TOG GAC 480 
Trp Lys Pro Thr Ser Ser Trp Asp Leu He Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

ACQ GGA TGC TCT AAA GTT GOC TGT GOC GOC TGC TAC GAC CTC TCC AAA CTA TCC AAT TCC 540 
Thr Gly Cys Ser lys Val Ala Cyu Gly Pro Cys Tyr Asp Leu Ser lys Val Ser Asn Ser 

TTC CAA GGG OCT ACT OGA GGG GOC AGA TGC AAC OCT CTA GTC CTA GAA TTC ACT GAT OCA 600 
Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 

GGA AAA AAG OCT AAC TOO GAC GGG CCC AAA TOO TOG GGA CTG AGA CTG TAC OOO ACA GGA 660 
Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

ACA GAT CCT ATT ACC ATG TTC TCC CTG ACC OGG CAG GTC CTT AAT GIG GGA CCC OGA GTC 720 
Thr Asp Pro He Thr Met Phe Ser Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg Val 

CCC AXA GGG CCC AAC GOC GTT CTG GCA GAC CAA GAG OCA CTC TCC AAG CCC AAA OCT GTT 780 ; 
Pro lie Gly Pro Asn Pro Val Leu Ala Asp Gin Gin Pro Leu Ser lys Pro Lys Pro Val 

AAG TOG OCT TCA QIC ACC AAA OCA CCC AGT GGG ACT CCT CTC TCC OCT ACC CAA CTT OCA 840 
Lys Ser Pro Ser Val Thr Lys Pro Pro Ser Gly Thr Pro Leu Ser Pro Thr Gin Leu Pro 

COG GOO GGT ACC GGA GAT AGA CTA CTA GCT CTA GTC AAA GGA GOC TAT CAG GOG CTT AAC 900 
Pro Ala Gly Thr Gly Asp Arg Leu Leu Ala Leu Val Lys Gly Ala Tyr Gin Ala Leu Asn 

CTC ACC AAT CCC GAC AAG ACC CAA GAA TGT TGG CTG TGC TEA GTG TOG GGA CCT CCT TAT 960 
Leu Thr Asn Pro Asp lys Thr Gin Glu Cys Trp Leu Cys Leu Val Ser Gly Pro Pro Tyr 

TAC GAA GGA GTA GOG GTC GTG GOC ACT TAT ACC AAT CAT TCC ACC GCT COG GCC AAC TGT 1020 
Tyr Glu Gly Val Ala Val Val Gly Thr Tyr Tfcr Asn His Ser Thr Ala Pro Ala Asn Cys 

ACG GCC ACT TCC CAA CAT AAG CTT ACC CTA TCT GAA GTG ACA GGA CAG GGC CTA TGC ATG 1080 
Thr Ala Thr Ser Gin His Lys Leu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu Cys Met 

GGG GCA GTA CCT AAA ACT CAC CAG GCC TTA TGT AAC ACC ACC CAA AGC GOC GGC TCA GGA 1140 
Gly Ala Val Pro Lys Thr His Gin Ala Leu Cys Asn Thr Thr Gin Ser Ala Gly Ser Gly 

TCC TAC TAC CTT GCA GCA CCC GCC GGA ACA ATG TGG GCT TGC AGC ACT GGA TTG ACT CCC 1200 
Ser Tyr Tyr Leu Ala Ala Pro Ala Gly Thr Met Trp Ala Cys Ser Thr Gly Leu Thr Pro 
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TOC 
Cys 


TTG 
Leu 


TOC 
Ser 


AOC 
Thr 


ACG 
Thr 


GTG 
Val 


CTC 
Leu 


AAT 
Asn 


CTA 
Leu 


ACC ACA 
Thr Thr 


GAT 
Asp 


TAT 

Tyr 


TGT GTA TTA GTT GAA CTC TGG 
Cys Val Leu Val Glu Leu Trp 


1260 


ccc 

Pro 


AGA 
Arg 


GTA 
Val 


ATT 
He 


TAG 
Tyr 


CAC 
His 


TOC 
Ser 


CCC 
Pro 


GAT 
Asp 


TAT ATG 
Tyr Met 


TAT 
Tyr 


GOT 
Gly 


CAG CTT GAA CAG OCT ACC AAA 
Gin Leu Glu Gin Arg Thr Lys 


1320 


HAT 
Tyr 


AAA 

Lys 


AGA 
Arg 


GAG 
GlU 


CCA 
Pro 


GTA 
Val 


TCA 

Ser 


TTG 
Leu 


ACC 
Thr 


CTG GOC 
Leu Ala 


CTT 
Leu 


CTA 
Leu 


CTA GGA GGA TTA AOC ATG GGA 
Leu Gly Gly Leu Thr Met Gly 


1380 


GGG 

Gly 


ATT 
lie 


OCA 
Ala. 


GCT 
Ala 


GGA 

mm 

G±y 


ATA 
■Lie 


GOG 
Gly 


AOG 


GGG 
Gly 


ACC ACT 


GOC 

AiA 


TTA 
Leu 


ATT AAA AOC CAG CAG TTT GAG 
xxe iys XDi Gin ud irne wu 


1440 


C2G 
Gin 


CTT 
Leu 


CAT 
His 


GOC 
Ala 


GCT 
Ala 


ATC 
lie 


GAG 
Gin 


ACA 
Tar 


GAC 
Asp 


CTC AAC 
Leu Asn 


GAA 

Glu 


GTC 
Val 


GAA AAG TCA ATT AOC AAC CTA 
Glu Lys Ser He Thr Asn Leu 


1500 


GAA 
Glu 


AAG 

Lys. 


TCA 
Ser 


CTG 
Leu 


ACC 
Thr 


TOG 
Ser 


TTG 
Leu 


TCT 
Ser 


GAA 
Glu 


GTA GTC 
Val Val 


CTA 
Leu 


GAG 
Gin 


AAC CGC AGA GGC CTA GAT TTG 
Asn Arg Arg Gly Leu Asp Leu 


1560 


CTA 
Lei 


1TC 
Phe 


CTA 
Leu 


AAG 
Lys 


GAG 
Glu 


GGA 
Gly 


GGT 
Gly 


CTC 
Leu 


TOC 

cys 


GCA'GOC 
Ala Ala 


CTA 
Leu 


AAA 
lys 


GAA GAA TGT TGT TTT TAT GCA 
Glu Glu cys Cys Ehe Tyr Ala 


1620 


GAC 
Asp 


CAC 
His 


ACG 
Thr 


GGG 
Gly 


CTA 
Leu 


OTG 
Val 


AGA 
Arg 


GAC 
Asp 


AGC 
Ser 


ATG GOC 
Met Ala 


AAA 
lys 


TTA 
Leu 


AGA GAA AGG CTT AAT CAG AGA 
Arg Glu Arg Leu Asn Gin Arg 


1680 


CM 
Gin 


AAA 

Lys 


CTA. 
Leu 


TTT 
Phe 


GAG 
Glu 


ACA 
Thr 


GGC 
Gly 


CAA 
Gin 


GGA 
Gly 


TOG TIC 
Trp She 


GAA 
Glu 


GOG 
Gly 


CTG TTT AAT AGA TOC CCC TGG 
Leu She Asn Arg Ser Pro Trp 


1740 


TTT 
Phe 


AOC 
Thr 


ACC 
Thr 


TTA 
Leu 


ATC 
lie 


TOC 
Ser 


AOC 
Thr 


ATC 

lie 


ATG 
Met 


GGA OCT 
Gly Pro 


CTA 
Leu 


ATA 
tie 


GTA CTC TTA CTG ATC TTA CTC 
Val Leu Leu Leu He Leu Leu 


1800 


TTT 
Phe 


GGA 
Gly 


OCT 
Pro 


TGC 

cys 


AIT 
lie 


CTC 
Leu 


AAT 
Ash 


OGA 
Arg 


TEA 
Leu 


GTT CAA 
Val Gin 


TTT 
She 


GTT 
Val 


AAA GAC AGG ATC TCA GTA GTC 
lys Asp Arg He Ser Val Val 


1860 



CAG GCT TTA 1869 
Gin Ala Leu 
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ATG GTA TTG CTG CCT GOG TCC ATG CTT CTC AOC TCA AAC CTG CAC CAC CTT COG CAC CAG 60 
Met Val Leu Leu Pro Gly Ser Met Leu Leu Thr Ser Asn Leu His His Leu Arg His Gin 

ATG AGT OCT GGG U3C TGG AAA AGA CTG ATC ATC CTC TTA AGC TGC GTA TTC GGC GGC GGC 120 
Met Ser Pro Gly Ser Trp Lys Arg Leu lie lie Leu Leu Ser Cys Val Phe Gly Gly Gly 

GGG ACG AGT CTG CAG AAT AAG AAC CCC CAG COC ATG AOC CTC ACT TGG CAffiCTA CTG TOC 180 
Gly Thr Ser Leu Gin Asn Lys Asn Pro Gin Pro Met Thr Leu thr Trp Gin Val Leu Ser 

CAA ACT GGA GAC GTT GTC TGG GAT ACA AAG GCA GTC CAG CCC CCT TGG ACT TGG TGG. COC 240 
Gin Thr Gly Asp Val Val Trp Asp Thr Lys Ala Val Gin Pro Pro Trp Thr Trp Trp Pro 

ACA CTT AAA OCT GAT GTA TGT GGC TTG GOG GCT AGT CTT GAG TOC TGG GAT ATC COG GGA 300 
Thr Leu Lys Pro Asp Val cys Ala Leu Ala Ala Ser Leu Glu Ser Trp Asp He Pro Gly 

AOC GAT GTC TOG TCC TCT AAA CGA GTC AGA OCT COG GAC TCA GAC TAT ACT GCC GCT TAT 360 
Thr Asp Val Ser Ser Ser Lys Arg Val Arg Pro Pro Asp Ser Asp Tyr Thr Ala Ala Tyr 

AAG CAA ATC AOC TGG GGA GCC ASA GGG TGC AGC TAG OCT OGG GCT AGS ACT AGA ATG GCA 420 
Lys Gin lie Thr Trp Gly Ala lie Gly Cys Ser Tyr Pro Arg Ala Arg Thr Arg Met Ala 



AGC TCT AOC TTC TAC GTA TGT COC OGG GAT GGC 
Ser Ser Thr Phe Tyr Val Cys Pro Arg Asp Gly 

GGG GGG CIA GAA TOC CTA TAC TGT AAA GAA TGG 
Gly Gly Leu Glu Ser Leu Tyr Cys lys Glu Trp 

TGG CTA TCT AAA TOC TCA AAA GAC CTC ATA ACT 
Trp Leu Ser Lys Ser Ser lys Asp Leu lie Thr 

ACT CAA AAA TTT CAA CAG TGT CAC CAG AOC GGC 
Thr Gin Lys Phe Gin Gin Cys His Gin Thr Gly 

ACA GAC AAA GGA AAA TTA TOC AAG GAC TGG ATA 
Thr Asp Lys Gly lys Leu Ser Lys Asp Trp lie 

TAT GT6 TCT GGA CAT OCA GGC GTA CAG TTC AOC 
Tyr val Ser Gly His Pro Gly Val Gin Phe Thr 

GCT GTG GCA GTA GGT OCT GAC CTC GTC CTT GTG 
Ala Val Ala Val Gly Pro Asp Leu Val Leu Val 

GCT CTC OCA OCT OCT CTT COC OCA AGG GAA GOG 
Ala Leu Pro Pro Pro Leu Pro Pro Arg Glu Ala. 

TCC ACA GCC CTG GOG ACT AGT GCA CAA ACT COC 
Ser Thr Ala Leu Ala Thr Ser Ala Gin Thr Pro 

AAC ACT CCG OCT COC AOC ACA GGC GAC AGA CTT 
Asn Thr Pro Pro Pro Thr Thr Gly Asp Arg Leu 

ACC TTA AAT GCT ACC AAC OCA GGG GCC ACT GAG 
Thr Leu Asn Ala Thr Asn Pro Gly Ala Thr Glu 

CCC CCT TAT TAT GAA GCA ATA GCC TCA TCA GGA 
Pro Pro Tyr Tyr Glu Ala He Ala Ser Ser Gly 

OGG TGC GGC TGG GGG AOC CAA GGA AAG CTC AOC 
Arg Cys Arg Trp Gly Thr Gin Gly Lys Leu Thr 



OGG 
Arg 


AOC 
Thr 


CTT 
Leu 


TCA GAA GCT AGA AGG TGC 
Ser Glu Ala Arg Arg Cys 


480 


GAT 
Asp 


TGT 
Cys 


GAG 
Glu 


AOC ACG GGG AOC GGT TAT 
Thr Thr Gly Thr Gly Tyr 


540 


GTA 
Val 


AAA 
I#s 


TGG 
Trp 


GAC CAA AAT AGC GAA TGG 
Asp Gin Asn Ser Glu Trp 


600 


TGG 
Trp 


TGT 

cys 


AAC 
Asn 


COC CTT AAA ATA GAT TTC 
Pro Leu Lys He Asp Phe 


660 


ACG 
Thr 


GGA 
Gly 


AAA 
Lys 


AOC TGG GGA TTA AGA TTC 
Thr Trp Gly Leu Arg Phe 


720 


ATT 
lie 


OGC 
Arg 


TTA 
Leu 


AAA ATC AOC AAC ATG OCA 
Lys He Thr Asn Met Pro 


780 : 


GAA 
Glu 


CAA 
Gin 


GGA 
Gly 


OCT OCT AGA AGG TOC CTC 
Pro Pro Arg Thr Ser Leu 


840 


OCA 
Pro 


COG 
Pro 


OCA 
Pro 


TCT CTC OCC GAC TCT AAC 
Ser Leu Pro Asp Ser Asn 


900 


ACG 
Thr 


GTG 
Val 


AGA 
Arg 


AAA ACA ATT GTT AOC CTA 
Lys Thr He Val Thr Leu 


960 


TTT 
Fhe 


GAT 
Asp 


CTT 
Leu 


GTG CAG GGG GCC TTC CTA 
Val Gin Gly Ala Phe Leu 


1020 


TCT 
Ser 


TGC 
Cys 


TGG 
Trp 


CTT TGT TTG GOC ATG GGC 
Leu Cys Leu Ala Met Gly 


1080 


GAG 
Glu 


GTC 
Val 


GCC 
Ala 


TAC TOC ACC GAC CTT GAC 
Tyr Ser Thr Asp Leu Asp 


1140 


CTC 
Leu 


ACT 
Thr 


GAG 
Glu 


GTC TCA GGA CAC GGG TTG 
Val Ser Gly His Gly Leu 


1200 
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TQC ATA GGA AAG GTS COC TIT ACC CAT CAG CAT CTC TGC AAT CAG ACC CTA TOC ATC AAT 1260 
Cys lie Gly Lys Val Pro Phe Thr His Gin His Leu Cys Asn Gin Thr Leu Ser He Asn 

TOC TOC GGA GAC CAT CAG TAT CTG CTC COC TCC AAC CAT AGC TGG TGG GC* >-TGC AGC ACT 1320 
Ser Ser Gly Asp His Gin Tyr Leu Leu Pro Ser Asn His Ser Trp Trp Ala Cys Ser Thr 

GGC CTC ACC OCT TGC CTC TOC ACC TCA GTT TTT AAT CAG ACT AGA GAT TIC TOT ATC CAG 1380 
Gly Leu Thr Pro Cys Leu Ser Thr Ser Val Phe Asn Gin Thr Arg Asp Phe Cys He Gin 

GTC CAG CTG ATT OCT OGC ATC TAT TAC TAT OCT GAA GAA GTT TTG TEA CAG GCC TAT GAC 1440 
Val Gin Leu He Pro Arg He Tyr Tyr Tyr Pro Glu Glu Val Leu Leu Gin Ala Tyr Asp 

AAT TCT CAC COC AGG ACT AAA AGA GAG OCT GTC TCA CTT ADC CTA OCT GTT TTA CTG GGG 1500 
Asa Ser His Pro Arg Thr Lys Arg Glu Ala Val Ser Leu Thr Leu Ala Val Leu Leu Gly 

TTG GGA ATC AGG GOG GGA ATA GOT ACT GGT TCA ACT GCC TTA. ATT AAA GGA OCT AXA GAC 1560 
Lea Gly He Hbr Ala Gly He Gly Thr Gly Ser Thr Ala Leu He Lys Gly Pro He Asp 

CTC CAG CAA GGC CTG ACA AGC CTC CAG ATC GCC ATA GAT OCT GAC CTC OGG GCC CTC GAA 1620 
Leu Gin Gin Gly Leu Thr Ser Leu Gin He Ala He Asp Ala Asp Leu Arg Ala Leu Gin 

GAC TCA GTC AGC AAG TTA GAG GAC TCA CTG ACT TOC CTG TOC GAG GTA GIG CTC CAA AAT 1680 
Asp Ser Val Ser Lys Leu Glu Asp Ser Leu Thr Ser Leu Ser Glu Val Val Leu Gin Asn 

AGS AGA' GGC CTT GAC TIG CTG TTT CTA AAA GAA GGT GGC CTC TCT GOG GCC CTA AAG GAA 1740 
.Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly Gly Leu Cys Ala Ala Leu Lys Glu 

GAG TGC TOT TTT TAC ATA GAC CAC TCA GGT GCA GTA OGG GAC TOC ATG AAA AAA CTC AAA 1800 
Gin Cys Cys Hie Tyr He Asp His Ser Gly Ala Val Arg Asp Ser Met Lys Lys Leu Lys 

GMl AAA CTG GAT AAA AGA CAG TTA GAG OGC CAG AAA AGC CAA AAC TGG TAT GAA GGA TGG 1860 
Glu Lys Leu Asp Lys Arg Gin Leu Glu Arg Gin Lys Ser Gin Asn Trp Tyr Glu Gly Trp 

TTC AAT AAC TOC OCT TGG TTC ACT ACC CTG CTA TCA ACC ATC GCT GGG OCC CTA TTA CTC 1920 
Phe Asn Asn Ser Pro Trp Phe Thr Thr Leu Leu Ser Thr He Ala Gly Pro Leu Leu Leu 

CTC CTT CTG TTG CTC ATC CTC GGG OCA TGC ATC ATC AAT AAG TTA GTT CAA TTC ATC AAT 1980 
Lea Leu Leu Leu Leu He Leu Gly Pro Cys He He Asn Lys Leu Val Gin Phe He Asn 

GAT AGG ATA AST GCA TCT 1998 
Asp Arg He Ser Ala Cys 
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ATS GOG CGT TCA ACG CTC TCA AAA CCC OCT CAA GAT AAG ATT AAC COG TOG AAG OCC TTA 60 
Met Ala Arg Ser ota Ser Lys Pro Pro Gin Asp Lys lie Asn Pro Trp Lys Pro Leu 

A1A GTC ATG GGA GTC CTG TTA GGA GTA GOG ATG GCA GAG AGC CCC CAG GTC TTT AAT GTA 120 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

ACT TOG AGA C7TC AOC AAC CTG ATG ACT GOG OGT ACC GOC AAT GOC ADC TOCjfTC CTG GGA 
ThrTrpAryValtoAsnLeuMettoCUyArgtoAlaAs^ 

ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 
to Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Qys Asp Leu Val Gly Glu Gin 

TOG GAC CCT TCA GAC CAG GAA COG TAT GTC GOG TAT GQC TGC AAG TAC CCC GCA GGG AGA 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG OGG ADC COG ACT TTT GAC TTT TAC GTG TGC CCT GGG O^T AOC GTA AAG TOG GOG TOT 
Gin Arg to Arg Oto Ete Asp Phe Tyr Val Cys Pro Gly His . to Val Lys Ser Gly Cys 

GGG GGA OCA GGA GAG GOC TAC TOP GGT AAA TOG GGG TGT GAA AOC AOC GGA CAG GCT TAC 
Gly Gly Pro Gly <£Lu Gly Tyr cys GOy lys Trp Gly Cys Olu to to Gly Gin Ala Tyr 

TGG AAG OCC ACA TCA TOG T3G GAC CTA AIC TCC CTT AAG OGC GGT AAC AOC CCC TGG GAC 480 
Trp Lys Pro to" Ser Ser Trp Asp Leu lie Ser Leu lys Arg Gly Asn to Pro Trp Asp 

AOG GGA TOC TCT AAA GTT GCC TGT GQC CCC TQC TAC GAC CTC TCC AAA GTA TCC AAT TCC 
toGlyCysSerlysValAlaCysCayProCysiyrAspl^SerlysVa^ 



180 



240 



300 



360 



420 



540 



600 



TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GXC CTA GAA TTC ACT GAT OCA 
Phe Gin Gly Ala to Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 

GGA AAA AAG GCT AAC TOG GAC GGG CCC AAA TOG TGG GGA CTG AGA CTG TAC OGG ACA GGA 
Gly Lys Lys Ala Asn Trp Asp Gly ProLysSerTrpGlyLeuArgLeuTyr Arg to Gly 

ACA GAT OCT ATT AOC ATG TIC TCC CTG ADC OGG CAG GTC CTT AAT GTG GGA CCC CGA GTC 
to Asp Pro He to Het Phe Ser Leu to Arg Gin Val Leu Asn Val Gly Pro Arg Val 

OCC ATA GGG OOC AAC CCA GTA TTA OCC GAC CAA AGA CTC OCT TOC TCA OCA ATA GAG ATT 780. 
Pro lie Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro Ser Ser Pro He Glu He 



660 



720 



GTA COG GCT OCA CAG OCA OCT AGC OCC CTC AAT ACC AGT TAC OOC CCT TCC ACT ACC AGT 
Val Pro Ala Pro GlnProProSerProLeuAsntoSerTyrPro Pro Ser to to Ser 



840 



ACA OCC OCC CCA GGT AOC GGA GAT AGA CTA CTA GCT CTA GTC AAA QGA GOC TAT CAG GCG 900 
to Pro Pro Pro Gly to Gly Asp Arg Leu Leu Ala Leu Val Lys Gly Ala Tyr Gin Ala 



CTT AAC CTC ACC AAT OOC GAC AAG ACC CAA GAA TGT TGG CTG TGC TTA GTG TOG GGA OCT 
Leu Asn Leu Thr Asn Pro Asp Lys to Gin Glu Cys Trp Leu Cys Leu Val Ser Gly Pro 



960 



OCT TAT TAC. GAA GGA GTA GOG GTC. GTG GGC ACT TAT AOC AAT CAT TCC ACC GCT COG GOC 1020 
Pro Tyr tyr Glu Gly Val Ala Val Val Gly Thr Tyr to Asn His Ser to Ala Pro Ala 



AAC TGT AOG GCC ACT TCC CAA CAT AAG CTT ACC CTA TCT GAA GTG ACA GGA CAG GGC CTA 
Asn Cys to Ala to Ser Gin His lys Leu Thr Leu Ser Glu Val to Gly Gin Gly Leu 

TGC ATG GGG GCA GTA CCT AAA ACT CAC CAG GOC TTA TGT AAC AOC ACC CAA AGC GCC GGC 
Cys Met Gly .Ala Val Pro Lys to His Gin Ala Leu Cys Asn to to Gin Ser Ala Gly 

TCA GGA TOC TAC TAC CTT GCA GCA OOC GCC GGA ACA ATG TGG GCT TGC AGC ACT GGA TIG 
Ser Gly Ser Tyr Tyr Leu Ala Ala Pro Ala Gly to Met Trp Ala Cys Ser to Gly Leu 
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CIA 
Leu 


ACC 
Thr 


ACA 
Thr 


GAT 
Asp 


TAT 
Tyr 


TGT CTA TTA GTT GAA 
Cys Val Leu Val Glu 


1260 


GAT 
Asp 


TAT 

iyr 


ATG 
Met 


TAT 

Tyr 


GGT 
Gly 


CAG CTT*t3AA CAG OCT 
Gin Leu Glu Gin Azg 


1320 


ADC 
Thr 


CTG 
Leu 


GOC 
Ala 


CTT 
Leu 


CTA 
Leu 


CTA GGA GGA TTA ACC 
Leu Gly Gly Leu Thr 


1380 


GOG 
Gly 


ACC 


ACT 
ttsr 


GOC 
Ala 


TEA 

Leu 


ATT AAA ACC CAG CAG 
lie Lys Thr Gin Gin 


1440 


GftC 
Asp 


CTC 
Leu 


AAC 
Asn 


GAA 
Glu 


GTC 
Val 


GAA AAG TCA ATT ACC 
Glu Lys Ser lie ttsr 


1500 


GAA 
Glu 


CTA 

val 


GTC 
Val 


CTA 

Leu 


CAG 
Gin 


AAC CGC AGA GOC CIA 
Asn Arg Axg Gly Leu 


1560 


TQC 
Cys 


GCA 
Ala 


GOC 
Ala 


CTA 
Leu 


AAA 

Lys 


GAA GAA TGT TGT TTT 
Glu Glu Cys Cys Phe 


1620 


AGC 
Ser 


ATG 
Met 


GOC 
Ala 


AAA 

Lys 


TTA 
Leu 


AGA GAA AGG CTT AAT 
Arg Glu Arg Leu Asn 


1680 


GGA 
Gly 


TOG 
Trp 


TTC 
Phe 


GAA 
Glu 


GOG 
Gly 


CTG TTT AAT AGA TOC 
Leu Phe Asn Arg Ser 


1740 


ATG 
Met 


GGA 
Gly 


OCT 
Pro 


CTA 
Leu 


ATA 
tie 


GTA CTC TTA CTG ATC 
Val Leu Leu Leu lie 


1800 


TTG GTC 
Leu Val 


CAA 
nil 


TTT 
Phe 


GTT 
Val 


AAA GAC AGG ATC TCA 
Lys Asp Arg lie Ser 


1860 



GTG GTC CAG GOT CTG 1875 
Val Val Gin Ala Leu 
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ATG GCG CGT TCA ACQ CTC TCA AAA CCC CCT CAA GAT AAG ATT AAC CCG TOG AAG CCC TEA 60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lya He Asn Pro Trp Lys Pro Leu 

■ 

ATA GTC ATG GGA GTC CTG OTA GGA GTA GGG ATG GCA GAG AGC CCC CAT CAG GTC TTT AAT20 
Lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro His Gin Val Phe Asn 

GTA ACC TOG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG 80 
Val Thx Trp Arg Val Thr Asn Leu kat Thr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu 

GGA ACT OTA CAA GAT GCC TTC CCA £AA TEA TAT TTT GAT CTA TCT GAT CTG GTC GGA GAG 40 
Gly Thr Val Gin Asp Ala Phe Pro bys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu 



bys 

pCG 



GAG TOG GAC CCT TCA GAC CAG GAA pCG TAT GTC GGG TAT GGC TGC AAG TAG CCC GCA GGG 00 
Glu Trp Asp Pro Ser Asp Gin Glu pro Tyr Val Gly Tyr Gly Cys Lya Tyr Pro Ala Gly 

AG* CAG CGG ACC CGG ACT TTT GAC TTT TAG GIG TGC CCT GGG CAT ACC GTA AAG TOG GGG 60 
Ar? Gin Arg Tnr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly 

TOT GGG GGA CCA GGA GAG GGC TAG TOT GOT AAA TGG GGG TOT GAA ACC ACC GGA CAG GC920 
Cya Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly cys Glu Thr Thr Gly Gin Ala 

TAC TCG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CTT AAG CGC GGT AAC ACC CCC TGG 80 
Tyr Trp Lys Pro Thr Ser Ser Trp Asp Leu He Ser Leu Lys Arg Gly Asn Thr Pro Trp 

GAC ACG GGA TGC TCT AAA CTT GCC TGT GGC CCC TGC TAC GAC CTC TCC AAA GTA TCC AA940 
Asp Thr Gly cys Ser Lys Val Ala Cys Gly Pro Cys Tyr Asp Leu 8er Lys Val Ser Asn 

I 

TCC TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAB 00 
Sex Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Gin Phe Tnr Asp 

GCA GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TCG TGG GGA CTG AGA CTG TAC CGG ACA 60 
Ala Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr 

GGA ACA GAT CCT ATT ACC ATG TTC TCC CTG ACC CGG CAG GTC CTT AAT GIG GGA CCC CG220 
Gly Thr Asp Pro He Tnr Met Phe per Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg 

GTC CCC ATA GGG CCC AAC CCA GTA TTA CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG 80 
Val Pro He Gly Pro Asn Pro Val pen Pro Asp Gin Arg Leu Pro Ser Ser Pro He Glu 

ATT GTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC ACT TAC CCC CCT TCC ACT ACG40 
He Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr Tnr 

ACT ACA CCC TCA ACC TCC CCT ACA ACT CCA ACT GTC CCA CAG CCA CCC CCA GCT ACC GAA 00 
Ser Thr Pro Ser Thr Ser Pro Thr Sex Pro Ser Val Pro Gin Pro Pro Pro Gly Thr Glu 

AAT AGG CTG CTA AAC TTA GTA GAC GGA GCC TAC CAA GCC CTC AAC CTC ACC ACT CCT GAG 60 
Asn Arg Leu Leu Asn Leu Val Asp Gly Ala Tyr Gin Ala Leu Asn Leu Thr Ser Pro Asp 

AAA ACC CAA GAG TCC TGG'TTG TCT CTA GTA GCG GGA CCC CCC TAC TAC GAA GGG GTT GOJD20 
Lys Thr Gin Glu Cys Trp Leu Cys Leu Val Ala Gly Pro Pro Tyr Tyr Glu Gly Val Ala 

i 

GTC CTG GGT ACC TAC TCC AAC CAT ACC TCT GCT CCA GCC AAC TGC TCC GTG GCC TCC CAS 80 
Val Leu Gly Thr Tyr Ser Asn His Thr Ser Ala Pro Ala Asn Cys Ser Val Ala Ser Gin 

CAC AAG TTG ACC CTG TCC GAA GTG ACC GGA CAG GGA CTC TGC ATA GGA GCA GTT CCC AAA 40 
Bis tya iieu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu Cys He Gly Ala Val Pro Lys 



ACA CAT CAG GCC CTA TGT AAT ACC ACC CAG ACA AGC ACT CGA GGG TCC TAT TAT CTA GTS 00 
Thr His Gin Ala Leu Cys Asn Thr par Gin Thr Ser Ser Arg Gly Ser Tyr Tyr Leu Val 
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GCC CCT ACA GOT ACC ATG TGG OCT TOT AOT ACC GGG CTT ACT CCA TGC ATC TCC ACC AOJ360 
Ala Pro Thr Gly Thr Met Trp Ala Cys Ser Thr Gly Leu Thr Pro Cys He 8er Thr Thr 

ATA CTG AAC CTT ACC ACT GAT TAT TGT GTT CTT GTC GAA CTC TGG CCA AGA GTC ACC TSB20 
He Lea Asn Leu Thr Thr Asp Tyr Cya Val Lett Val Glu Leu Trp Pro Arg Val Thr Tyr 



CAT TCC CCC AGC TAT GOT TAC GGC CTG TOP 
His Bar Pro Ser Tyr Val. Tyr Gly Leu Phe 

GTQ TCG TTA ACC CTG GCC CTA TEA TTG GOT 
Val Ser Leu Thr Leu Ala Leu Leu Leu Gly 

ATA GGG ACG GGG ACC ACT GCC 
lie Gly Thr Gly Thr Thr Ala 

ATC CAG ACA GAC CTC AAC GAA GTC GAA AAG 
He Gin Thr Asp Leu Asa Glu Val Glu Lys 

TCG TTG TCT GAA CTA GTC CTA CAG ^AC CGC 
Ser Leu Ser Glu Val Val Leu Glu Aan Arg 

GGA GGT CTC TGC GCA GCC CTA AAA joAA GAA 
Gly Gly Leu Cys Ala Ala Leu Lys plu Glu 

GIG AGA GAC AGC ATG GCC AAA TTA AGA GAA 
Val Arg Asp Ser Mat: Ala Lys Xeu Arg Glu 

ACA GGC CAA GGA TGG TTC GAA GGG CTG TTT 
Thr Gly Gin Gly Trp Phe Glu Gly Leu Phe 

TCC ACC ATC ATG GGA CCT CTA ATA GTA CTC 
Ser Thr He Met Gly Pro Leu lie Val Leu 

CTC AAT CGA TTA GTT CAA TTT OTT AAA GAC 
Leu Asn Arg Leu Val Gin Phe Val Lys Asp 

ACT CAA CAA TAC CAC CAG CTA AAA CCA CTA 
Thr mn Gin Tyr His Gin Leu Lys Pro Leu 



GAG AGA TCC AAC CGA CAC AAA AGA GAA COG 80 
Glu Arg Ser Asn Arg His Lys Arg Glu Pro 

GGA CTA ACC ATG GGA GGG ATT GCA OCT GGA 40 
Gly Leu Thr Mat Gly Gly lie Ala Ala Gly 

CAG CAG TTT GAG CAG CTT CAT GCC GOT 00 
Gin Gin Phe Glu Gin Leu His Ala Ala 

TCA ATT ACC AAC CTA GAA AAG TCA CTG AO06O 
Ser lie Thr Asn Leu Glu Lys Ser Leu Thr 

AGA GGC CTA GAT TTG CTA TTC CTA AAG G&/820 
Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu 

TCT TGT TTT TAT GCA GAC CAC ACG GGG CTX80 
Cys Cys Phe Tyr Ala Asp His Thr Gly Leu 

AGG CTT AAT CAG AGA CAA AAA CTA TTT GAG 40 
Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu 

AAT AGA TCC CCC TGG TTT ACC ACC TTA AT0OO 
Asn Arg Ser Pro Trp Phe Thr Thr Leu lie 

TTA CTG ATC CTA CTC TTT GGA CCT TGC ATB60 
Leu Leu He Leu Leu Phe Gly Pro Cys He 

AGG ATC TCA GTA GTC CAG GCT TTA GTC CTO20 
Arg lie Ser Val Val Gin Ala Leu Val Leu 

GAA TAC GAG CCA 1962 

Glu Tyr Glu Pro 



TTA ATT AAA ACC 
Leu He Lys Thr 
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ATG.GCG COT TCA ACQ CTC TCA AAA ^CC CCT CAA GAT AAG ATT AAC CCG TGG AAG CCC TTA60 
Met Ala Arg 8er Thr Leu sex Lya pro Pro Gin Asp Lys He Aan Pro Trp Lys Pro Leu 

ATA GTC ATG GGA GTC CTG TTA GGA OTA GGG ATG GCA GAG AGC CCC CAG CTC TTT AAT GTA20 
lie Vol Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Aan Val 

ACC TGG AGA GTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG GGA 80 
Thr Trp Arg Val Thr Aan Leu Met Thr Gly Arg Thr Ala Aan Ala Thr Ser Leu Leu Gly 

ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TOT GAT CTG GTC GGA GAG GAO40 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC CCT TCA GAC CAG GAA CCG SAT GTC GGG TAT GGC TGC AAG TAG CCC GCA GGG AGS00 ' 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC OTA AAG TCG GGG TOT 60 
Gin Arg Thr Arg Thr Phe Asp Phe .Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly Cys 

GGG GGA CCA GGA GAG GGC TAC TOT GOT AAA TGG GGG TOT GAA ACC ACC GGA CAG GCT TAS20 
Gly Gly Pro Gly Glu Gly Tyr Cys ply Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

TGG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CTT AAG CGC GOT AAC ACC CCC TGG GAS 80 
Trp Lys Pro Thr Ser Ser Trp Asp peu lie Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

ACG GGA TGC TCT AAA CTT GCC TOT |3GC CCC TGC TAC GAC CTC TCC AAA CTA TCC AAT TCG 40 
Thr Gly Cys Ser Lys Val Ala Cys Gly Pro Cys Tyr Aap Leu Ser Lys Val Ser Asn Ser 

TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAT GCJfiOO 
Phe Gin Gly Ala Thr . Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 

GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TCG TGG GGA CtfG AGA CTG TAC CGG ACA GGS60 
Gly Lya Lys Ala Aan Trp Asp Gly Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

ACA GAT CCT ATT ACC ATG TTC TCC CTG ACC CGG CAG OTC CTT AAT GTG GGA CCC CGA GTJ20 
Thr Aap Pro lie Thr Met .Phe Ser Leu Thr Arg Gin Val Leu Aan Val Gly Pro Arg Val 

CCC ATA GGG CCC AAC CCA GTA TEA CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG ATT 80 . 
Pro lie Gly Pro Asn Pro Val Leu jpro Asp Gin Arg Leu Pro Ser Ser Pro lie Glu He 

CTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC ACT TAC CCC CCT TCC ACT ACC ACT40 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr Thr Ser 

ACA CCC TCA ACC TCC CCT ACA AGT CCA ACT GTC CCA CAG CCA CCC CCA GOT ACC GAA AAV00 
Thr Pro ser Thr Ser Pro Thr 8er ^ro Ser Val Pro Gin Pro Pro Pro Gly Thr Glu Asn 

AGG CTG CTA AAC TTA OTA GAC GGA fcCC TAC CAA GCC CTC AAC CTC ACC AGT CCT GAC AAA 60 
Arg Leu Leu Asn Leu Val Asp Gly ^la Tyr Gin Ala Leu Asn Leu Thr Ser Pro Asp Lys 

ACC CAA GAG TGC TGG TTG TOT CTA bTA GCG GGA CCC CCC TAC TAC GAA GGG GTT GCC GTO20 
Thr Gin Glu Cys Trp Leu Cyo Leu j/al Ala Gly Pro Pro Tyr Tyr Glu Gly Val Ala Val 

CTG GCT ACC TAC TCC AAC CAT ACC TCT GCT CCA GCC AAC TGC TCC GTG GCC TCC CAA CAB80 
Leu Gly Thr Tyr Ser Asn His Thr Ser Ala Pro Ala Asn Cys Ser Val Ala Ser Gin His 

AAG TT& ACC CTG TCC GAA GTG ACC GGA CAG GGA CTC TGC A*EA GGA GCA GTT CCC AAA AOA40 
Lys Leu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu cys He Gly Ala Val Pro Lys Thr 

CAT CAG GCC CTA TOT AAT ACC ACC CAG ACA AGC ACT CGA GGG TCC TAT TAT CTA GTT GOG 00 
His Gin Ala Leu Cys Asn Thr Thr Gin Thr Ser Ser Arg Gly Ser Tyr Tyr Leu Val Ala 
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CCT ACA GOT ACC ATG TOG OCT TOT AGT ACC GGG CTT ACT CCA TGC ATC TCC ACC ACC ATS 60* 
Pro Thr Gly Thr Met Trp Ala Cys Ser Thr Gly Leu Thr Pro Cys lie Ser Thr Thr lie 

CTG AAC CTT ACC ACT GAT TAT TGT |3TT COT GTC GAA CTC TGG CCA AGA GTC ACC TAT CAS 20 
Leu Asa Leu Thr Thr Asp Tyr Cys yal Leu Val Glu Leu Trp Pro Arg Val Thr Tyr His 

TCC CCC AGC TAT OTT TAG GGC CTG [TTT GAG AGA TCC AAC CGA CAC AM AGA GAA CCG GTO80 
Ser Pro Ser Tyr Val Tyr Gly Leu phe Glu Arg Ser Asn Arg His Lys Arg Glu Pro Val 

TCG TTA ACC CTG GCC CTA TEA TTG |3GT GGA CTA ACC ATG GGA GGG ATT OCA GCT GGA AM 40 
Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Wet Gly Gly lie Ala Ala Gly lie 

GGG ACG GGG ACC ACT GCC TTA ATT AAA ACC GAG CAG TTT GAG CAG CtT CAT GCC GCT AT600 
Gly Thr Gly Thr Thr Ala Leu lie Lys Thr Glu Gin Phe Glu Gin Leu' Bis Ala Ala lie 

CAG ACA GAC CTC AAC GAA CTC GAA AAG TCA ATT ACC AAC CTA GAA AAG TCA CTG ACC TCDS60 
flin fljr Asp Leu Asn Glu Val Glu Lys Ser lie Thr Asn Leu Glu Lys Ser Leu Thr Ser 

TTG TCT GAA CTA GTC CTA. CAG AAC CGC AGA GGC CTA GAT TTG CTA TIC CTA AAG GAG GOJ620 
Leu Ser Glu Val Val Leu Gin Asn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly 

GGT CTC TGC GCA GCC CTA AAA GAA GAA TGT TGT TTT TAT GGA GAC CAC ACG GGG CTA CM 80 
Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe Tyr Ala Asp His Thr Gly Leu Val 

AGA GAC AGC ATG GCC AAA TCA AGA GAA AGG CTT AAT CAG AGA CAA AAA CTA TTT GAG AOS40 
Arg Asp Ser Mot Ala £ys Leu Arg Glu Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu Thr 

GGC CAA GGA TGG TIC GAA GGG CTG TTT AAT AGA TCC CCC TGG TTT ACC ACC TTA ATC TCKS00 
Gly Gin Gly Trp Phe Glu Gly Leu phe Asn Arg Ser Pro Trp Phe Thr Thr' Leu lie Ser 

ACC ATC ATG GGA CCT CTA ATA GTA fcTC TTA CTG ATC TTA CTC TTT GGA CCT TGC ATT CT860 . 
Thr lie Met Gly Pro Leu He Val pen Leu Leu He Leu Leu Phe Gly Pro Cys He Leu 

AAT CGA TTA GTT CAA TTT OTT AAA pAO AGG ATC TCA OTA GTC CAG GCT TTA CTC CTG A&920 
Asn Arg Leu Val Gin Phe Val Lys £sp Arg He Ser Val Val Gin Ala Leii Val Leu Thr 

CAA CAA TAG CAC CAG CTA AAA CCA CTA GAA TAG GAG CCA 1959 
Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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ATO GCG CGT TCA ACG CTC TCA AAA \XC CCT CAA GAT AAG ATT AAC CCG TGG AAG CCC TXA60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Pro Gin Asp Lys Ho Asa Pro Trp Lys Pro Leu 

ATA GTC ATC GGA GTC CTC TTA GGA JGTA GGG ATG GCA GAG AGC CCC CAT CAG CTC TTT AAT20- 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Hid Gin Val Phe Aon 

GT* ACC TGG AGA CTC ACC AAC CTG KTG ACT GGG CGT ACC GCC AAT GCC ACC TCG CTC CTG 80 
val Thr Trp Arg Val Thr Asn Leu \tot Thr Gly Arg Thr Ala AsnJVlavafcr ^r/ fteu Leq.- ' 



GGA ACT GTA CAA GAT GCC TTC CCA KAA TEA TAT TTT GAT CTA TGT GAT CTG CTC GGA GAG 40 
Gly Thr Val Gin Asp Ala Phe Pro pys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu 

GAG TCG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAC CCC GCA GGG 00 
Glu Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly 

AGA CAG CGG ACC CGG ACT TTT GAC TTT TAG GTG TGC CCT GGG CAT ACC GTA AAG 'TCG GGG 60 
Arg Gin Arg Thr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly 

TOT GGG GGA CCA GGA GAG GGC TAG TGT QOT AAA, TG?. : GP9 TGT GAA ACC ACC GGA CAG GCM0 
cys Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Trp Gly Cys Glu Thr Thr Gly Gin Ala 

TAC TGG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CTT AAG CGC GGT AAC ACC QCC TGG 80 
Tyr Trp Lys Pro Thr Ser Ser Trp Asp Leu lie Ser Leu *ys Arg Gly Asn ttr Pro Tro 

GAC ACG GGA TGC TCT AAA GTP GCC iCGT GGC CCC TGC TAC GAC CTC TCC AAA GTA TCC AAJ49 
Asp Thr Gly cys ser Lys Val Ala bys Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn 



TCC TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAB 00 
sex Phe Gin Gly Ala Thr Arg Gly tily Arg Cys Asn Pro Leu Val Leu Glu Phe Ofcr Asp 

GCA GGA AAA AAG GCT AAC TOO GAC boG CCC AAA TCG TGG GGA C$Q AGA CTG T£Q QGQ 4£$£0 

[siy 



GGA ACA GAT CCT ETT ACC ATG TTC |PCC CTG ACC CGG CAG GTC pCT ^T CTG §GA £CC C£g20 

per Leu Thr Arg Gin Val Leu -Asn Val Gly Pro &rg 



Ala Gly Lys Lys Ala Asn Trp Asp ply Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr 

GGA ACA GAT CCT ATT ACC ATG TTC 
Gly Thr Asp Pro lie Thr Met Phe 

GTC CCC ATA GGG CCC AAC CCA CTA |n» CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG 80 
Val Pro He Gly Pro Asn Pro Val ken Pro Asn Gin Arg Leu Pro Ser Ser £ro lie Q}u 

ATT GTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC Atf* TAC fCJ CCT TOJ ^Cgf? 
He Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro ser Thr Thr 



ACT ACA CCC TCA ACC TCC CCT ACA ACT CCA ACT GTC CCA CAG CCA CCC CCA GGA ACT 
ser Ofcr Pro Ser Thr Ser Pro Thr 8er Pro Ser Val Pro Gin Pro Pro Pro Q^y ^hr Gly 

GAT AGA CTA CTA GCT CTA GTC AAA GGA GCC TAT CAG GCG CTT AAC CTC ACC AAJ CCC GA06O 
Asp Arg Leu Leu Ala Leu Val Lys Gly Ala Tyr Gin Ala Leu Asn Leu Thr Asn Pro Asp 

AAG ACC CAA GAA TGT TGG. CTG TGC fTEA GTG TCG GGA CCT CCT TAT TAC GAA GGA CTA GO® 20' 
Lys Ohr Gin Glu Cys Trp Leu Cys £eu Val Ser Gly Pro Pro Tyr Tyr Glu Gly Val Ala 

GTC GTG GGC ACT TAT ACC AAT CAT TCC ACC GCT CCG GCC AAC TGT ACG GCC ACT TCC CA»80 
Val val Gly Thr Tyr Thr Asn His jper Thr Ala Pro .Ala Asn Cys Thr Ala Tfcr Ser Gin 

CAT AAG CTT ACC CTA TCT GAA GTG ACA GGA CAG GGC CTA TGC ATG GGG GCA GTA CCT AA$.40 

{ihr ' ~ ~ '" ~ " ~ 



His Lys Leu Thr Leu Ser Glu Val phr Gly Gin Gly Leu Cys Met Gly Ala Val Pro Lys 

ACT CAC CAG GCC TTA TCT AAC ACC hCC CAA AGC GCC GGC TCA GGA TCC TAC TAC CTT GQ2J 
Thr Bis Gin Ala Leu Cys Asn Thr Jlhr Gin Ser Ala Gly Ser Gly Ser Tyr Jyr Leu Ala 
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GCA. CCC GCC GGA ACA ATG TGG GCT TGC AGC ACT GGA TTQ ACT CCC TGC TTQ TCC ACC AUG 60 
Ala Pro Ala Gly Thr Met Trp Ala Cys Ser Thr Gly lieu Thi Pro Cys Leu ser Thr Thr 

GTG CTC AAT CTA ACC ACA GAT TAT TGT GTA TEA GTT GAA CTC TGG CCC AGA GTA ACT TBO20 
Val Leu Asn Leu Thr Thr Aap Tyr Cys Val Leu Val Glu Leu Trp Pro Arg Val He Tyr 

CAC TCC CCC GAT TAT ATG TAT GGT CAG CTT GAA GAG CGT ACC AAA TAT AAA AGA GAG COS SO 
His Ser Pro Asp Tyr Mat Tyr Gly Gin Leu Glu Gin Arg Thr Lys Tyr Lys Arg , gin pro 

GT& TCA TTQ ACC CT3 GCC CTT CTA CTA GGA GGA TEA ACC ATG GGG GGA ATT GCC OCT GGA 40 
Val Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Met Gly Gly He Al a Ala Gly 

ATA GGA ACA GGG ACT ACT GCT CTA ATG GCC ACT CAG GAA TTC CAG CAG. CTC CAA GCC GO500 
He Gly Thr Gly Thr Thr Ala Leu Met Ala Thr Gin Gin Phe Gin Gin Leu Gin Ala Ala 

GTA CAG GAT GAT CTC AGG GAG GTT pAA AAA TCA ATC TCT AAC CTA GAA AAG'TCT'CTC AO360 
Val Gin Asp Asp Leu Arg Glu Val plu Lys Ser He Ser Asn Leu Glu Lys Ser Leu Thr 

TCC CT3 TCT GAA GTT GTC CTA CAG j&AT. C£A AQG GGC. <JTA GAC TT9 TTA TTT CTA -AAA GAB 20 
Ser Leu Ser Glu Val Val Leu Gin Asn Arg Arg Gly' Leu Asp Leu Leu Phe Leu Lys Glu 

GGA GGG CTG TGT GCT GCT CTA AAA GAA GAA TCT TGC TTC TAT GCG GAC CAC ATA GGA CT B 80 
Gly Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys ' Phe . Tyr Ala Asp Pis Thr Gly Leg - 

GTG AGA GAC AGC ATG GCC AAA TTG AGA GAG AGG CTT AAT CAG AGA CAG^AAA CTQ^VTT GA04O- 
Val Arg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn Q^i Arg «ln Lys Leu ?}ie GjLu " 

TCA ACT CAA GGA TSG TTT GAG GGA CTG TTT AAC AGA TCC CCT TGG ^TT ACC ACC TTG AIM 00 
Ser Thr Gin Gly Trp Phe Glu Gly Leu Phe Asn Arg Ser Pro Trp Phe Thr Thr Leu He 

TCT ACC ATT ATG GGA CCC 'CTC ACT CTA CTC CTA ATG ATT TOG C^f GGA £CQ ^gC ATPSP 

Ser Hhr He Met Gly Pro Leu He Val Leu Leu Met He Leu Leu Phe Gly Pro Cys He 

CTT AAT CGA TTA GIT CAA TTT OTT AAA GAC AGG ATC TCA GTA GTC CAG GCT TTA 'CTC C3O20 
Leu Asn Arg Leu Val Gin Phe Val Lys Asp Arg Ha Ser Val Val Gin Ala Leu Val Leu" 

ACT CAA CAA TAG CAC CAG CTA AAA .CCA CTA GAA TAC GAG CCA 1962. 
Thr Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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ATG GCG CGT TCA ACQ CTC TCA AAA 
Met Ala Arg Ser Thr Lea Ser Lys 


;CC CCT GAA GAT AAG ATT AAC GCG TGG AAG CCC TXA60 
?ro Gin Asp Lys He Asn Pro Trp Lys Pro Leu 



ATI GTC ATG GGA GTC CTG TEA GGA CTA GGG ATG GGA GAG AGC CCC CAG GTC TOT AAT GZ&20 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 



ACC TGG AGA CTC ACC AAC CTG ATG ACT GGG CGT ACC GCC AAT GCC ACC TCC CTC CTG GG&S0 
Thr Trp Arg Val Thr Asn Leu Met Thr Gly Arg Thr Ala Asa Ala Thr Ser Leu Leu Gly 

ACT GTA CAA GAT GCC TTC CCA AAA TTA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 40 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAG CCC GGA GGG AGA 00 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG CGG ACC CGG ACT TTT GAC TTT TAG CTG TGC CCT GGG CAT ACC GTA AAG TGG GGG TOT 60 
Gin Arg Thr Arg Thr Phe Asp Phe Tyr val Cys Pro Gly His Tfcr Val Lys Ser Gly Cys 

dGG GGA CCA GGA GAG GGC TAC TGT GCT AAA TGG GGG TGT GAA ACC ACC GGA CAG GCT TAS20 
Gly Gly Pro Gly Glu Gly Tyr Cys laly Lys Trp Gly cys Glu Tfer Tfer Gly Gin Ala Tyr 

TGG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CTT AAG CGC GOT AAC ACC GCC TGG GA480 
Trp Lys Pro Tfcr Ser Ser Trp Asp |&eu He Ser Leu Lya Arg Gly Asn tfir Pro "Trp Asp 

ACG GGA TGC TCT AAA GTT GCC TGT feGC CCC TGC TAC GAC CTC TCC AAA GTA TCC "AAT v TCC 40 
Thr Gly cys Ser Lys Val Ala Cys |3ly Pro Cys Tyr Asp Leu ser *ys Val Ser Asn Ser 

TTC CAA GGG GCT ACT CGA GGG GGC XGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAT GOS00 
Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu phe Thr Asp Al^ ' 

GGA AAA AAG GCT AAC TGG GAC GGG CCC AAA TCG TGG GGA CTG AGA CTQ TAC CGG ACA GGA 60 
Gly Lys Lys Ala Asn Trp Asp Gly Pro Lys Ser Trp Gly Leu Arjj j&eu ^r Artj Tfer Gjjy 

ACA GAT CCT ATT ACC ATG TTC TCC CTG ACC CGG CAG CTC CTT AAT GTG GGA CCC CGA GTC7?0 
Thr Asp Pro lie Thr Met Phe Ser Leu Thr Arg Gin Val Leu Asn Val Gly p£> Arg Val ' 

CCC ATA GGG CCC AAC CCA OTA TTA CCC GAC CAA AGA CTC CCT TCC TCA CGA ATA GAG ATT 80 
Pro lie Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro Ser Ser Pro Ha Glu He ' 

GTA CGG GCT CCA CAG CCA CCT AGC hcc CTC AAT ACC ACT TAC CCC QCT JCC &r? £CC AG?4Q 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr. Thr Ser 

ACA CCC TCA ACC TCC CCT ACA AGT CGA ACT GTC CCA CAG CCA CCC CCA GGA ACT GGA GA900 
Thr Pro ser Thr Ser Pro Thr Ser pro Ser Val Pro Gin Pro Pro Pro Gly Tto Gly Asp 

AGA CTA CTA GCT CTA GTC AAA GGA bcc TAT CAG GCG CTT AAC CTC ACC AAT CCC GAC AAG 60 
Arg Leu Leu Ala Leu Val Lys Gly ^la Tyr Gin Ala Leu Asn Leu Thr Asn Pro Asp Lys 

' *■ • i • ; \. \ « ■ s ;». i4 ^ 

ACC CAA GAA TGT TGG CTG TGC TTA bTS TCG GGA CCT CCT TAT TAC GAA GGA OTA GCG (7BD20 
Thr Gin Glu Cys Trp Leu Cys Leu pal Ser Gly Pro Pro Tyr Tyr Glu Gly Val Ala Val' 

GTG GGC ACT TAT ACC AAT CAT TCC ACC GCT CCG GCC AAC TCT ACG GCC ACT TCC CAA CAS 80 
Val Gly Tfcr Tyr Thr Asn His Ber Thr Ala Pro Ala Asn Cys Tfcr Ala Thr Ser Gin His 

AAG COT ACC CTA TCT GAA CTG ACA GGA CAG GGC CTA TGC ATG GGG GCA GTA CCT AAA AO* 40 
Lys Leu Thr Leu Ser Glu Val Thr Gly Gin Gly Leu cys Met Gly Ala Val Pro Lys Thr 

CAC GAG GCC TTA TCT AAC ACC ACC CAA AGC GCC GGC TCA GGA TCC TAC TAC CTT GCA GO300 
His Gin Ala Leu Cys Asn Thr Thr Gin Ser Ala Gly Ser Gly Ser Tfrr fyr £eu Ala Ala 
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CCC GCC GGA ACA ATO TOG GCT TGC AGC ACT GGA TTQ ACT CCC TGC TTQ TCC ACC ACQ GXQ60 
Pro Ala Gly Thr Met Trp Ala Cys -Ser Thr Gly Leu Iftr Pro cys Leu Ser ahr Thr Val 

CTC AAT CTA ACC ACA GAT TAT TGT GTA TTA GTT GAA CTC TGG CCC AGA GTA ATT TAC 00020 
Leu Abu Leu Thr Thr Asp Tyr Cys yaX Leu Val Glu Leu Trp Pro Arg Val He ^yr His 

TCC CCC GAT TAT ATG TAT GGT CAG fcTT GAA CAG CGT ACC AAA TAT AM A£A Q&p-CCA GX9?Q 
Ser Pro Asp Tyr Met Tyr Gly Gin |teu Glu Glu Arg Thr Lys Tyr Lys Arg Glu^Pzo Val * 

TCA TTQ ACC CTG GCC CTT CTA CTA GGA GGA TTA ACC ATO GGG GGA AST' GCC : GCT" GGA A7a4D 
ser Leu Tfcr Leu Ala Leu Leu Leu Gly Gly Leu Thr Met Gly Gly lie Ala Ala Gly He 

GGA ACA GGG ACT ACT GCT CTA ATO GCC ACT CAG CAA TTC CAG CAG CTC CAA GCC QCA GTS 00 
Gly Thr Gly Thr Thr Ala Leu Met Ala Thr Gin Gin Phe flin Qlj Leu Gin Ala Ala Val 

CAG GAT GAT CTC AGG GAG GTT GAA AAA TCA ATC TCT AAC C3*A GAA AAG TCT CTC ACT TOO 60 
Gin Asp Asp Leu Arg Glu Val Glu Lys 8er lie Ser Asn Leu Glu Lys Ser Leu Thy Ser 

CTG TCT GAA GTT GTC CTA CAG AAT CGA AGG GGC CTA QA£ TTG TEA TIT CTA AAA GAA GO* 20 
Leu Ser Glu Val Val Leu .Gin Asn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly 

GGG CTO TGT GCT GCT CTA AAA GAA GAA TGT TGC TIC TAT GCG GAC r * r GGA CTA G BH I80 

Gly Leu Cys Ala Ala Leu Lys Glu blu Cys Cys Phe Tyr A>» Asp His Thr Gly Leu Val 

AGA GAC AGC ATG GCC AAA TTG AGA GAG AGG CTT AAT CAG AGA CAG AAA CTG TIT "GAG TO840 
Arg Asp Ser Met Ala Lys Leu Arg Glu Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu Ser 

ACT CAA GGA TOG TTT GAG GGA CTO TTT AAC AGA TCC CCT TOG TTT ACC ACC TOG ATA TOBOO 
Thr Gin Gly Trp Phe Glu Gly Leu phe Asn Arg Ser Pro Trp phe Ihr Thr L^u He Ser" 

ACC ATT ATO GGA CCC CTC ATT GTA CTC CTA ATO ATT TOG CTC TTC GGA CCC TCC ATT CTB60 
Thr lie Met Gly Pro Leu lie Val beu Leu Met lie Leu Leu Phe Gly £ro Cys Jje Leu^ 

AAT CGA TTA GTT CAA TTT GTT AAA GAC AGG ATC TCA GTA OTC CAG GQT TTA GTC QTO 

Asn Arg Leu Val Gin Phe Val Lys psp Arg He Ser VaX Val Gin Ala Leu V$JL Leu Thr " : 

CAA CAA TAC GAC CAG CTA AAA CCA CTA GAA TAC GAG CCA 1959 
Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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ATX? GCQ COT TCA ACQ CTC TCA AAA fcCC CCT CAA GAT AAA ATT AAC CCQ TGQ AAG CCC TTA60 
Met Ala Arg Ser Thr Leu Ser Lys pro Pro Gin Asp Lys He Asa Pro Trp Lye Pro Leu 

ATA GTC ATG GGA GTC CTG TTA GGA OTA GGG ATG GCA GAG AGC CCC CAT CAG GTC TTT AAT20 
lie Val Met Gly Val Leu Leu Gly Val Gly Met Ala Glu Ser Pro His Gin Val Phe Asn 

GTA ACC TGG AGA GTC ACC AAC CTG KTO ACT GGG COT ACC GCC AAT GCC ACC TCC CTC CTG 80 
Val Thx Trp Arg Val Thr Asn Leu pet Thr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu 

GGA ACT GTA CAA GAT GCC TTC CCA KAA TTA TAT TTT GAT CTA TOT GAT CTG GTC GGA GAG 40 
Gly Thr Val Gin Asp Ala Phe Pro fcys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu 

GA6 TGG GAC CCT TCA GAC CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAC CCC GCA GGG 00 
Glu Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys. Tyr Pro Ala Gly 

AGA CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC CTA AAG TCG GGG60 
Arg Gin Arg Thr Arg Thr Pbe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly 

* 

TOT GGG GGA CCA GGA GAG GGC TAC TOT GOT AAA TGG GGG TOT GAA ACC ACC GGA CAG GCI20 
cys Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lye Trp Gly cys Glu Thr Thr Gly Gin Ala 

TAC TGG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CTT AAG CGC GOT AAC ACC CCC TGQ 80 
Tyr Trp Lys Pro Thr Ser .Ser Trp Asp Leu lie Ser Leu Lys Arg Gly Asn Thr Pro Trp 



TOT 

pya 



Asp Thr Gly eye Ser Lys Val Ala pya Gly Pro Cys Tyr Asp Leu Ser Lys Val Ser Asn 

TCC TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAS 00 
Ser Phe Gin Gly Ala Thr Arg Gly Gly Arg Cys Asn Pro Leu Val Leu Glu Phe Thr Asp 

GCA GGA AAA AAG. GCT AAC TGG GAC GGG CCC AAA TCG TGG GGA CTG AGA CTG TAC CGG ACS 60 
Ala Gly Lys Lys Ala Asn Trp Asp ply Pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr 

GGA ACA GAT CCT ATT ACC ATG TTC tTCC CTG ACC CGG CAG OTC CTT -AAT GTG GGA CCC COS 20 
Gly Thr Asp Pro lie Thr Met Phe per Leu Thr Arg Gin Val Leu Asn Val Gly Pro Arg 



GTC CCC ATA GGG CCC AAC CCA GTA jlTA CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG 80 
Val Pro lie Gly Pro Asn Pro Val peu Pro Asp Gin Arg Leu Pro Ser ser Pro He Glu 



ATT GTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC ACT TAC CCC CCT TCC ACT ACS 40 
lie Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Tyr Pro Pro Ser Thr Thr 

ACT ACA CCC TCA ACC TCC CCT ACA ACT CCA ACT GTC CCA CAG CCA CCC CCA GOT ACC GAS 00 
Ser Thr Pro Ser Thr Ser Pro Thr Ser Pro Ser Val Pro Gin Pro Pro Pro Gly Thr Glu 

AAT AGG CTG CTA AAC TTA GTA GAC GGA GCC TAC CAA GCC CTC AAC CTC ACC AOT CCT GAG 60 
Asn Arg Leu Leu Asn Leu Val Asp Gly Ala Tyr Gin Ala Leu Asn Leu Thr Ser Pro Asp 

AAA ACC CAA GAG TGC TGG TTQ TOT CTA GTA GCG GGA CCC CCC TAC TAC GAA GGG CTT GOD 20 
Lys Thr Gin Glu Cys Trp Leu Cys Leu Val Ala Gly Pro Pro Tyr Tyr Glu Gly Val Ala 

GTC CTG GOT ACC TAC TCC AAC CAT ACC TCT GCT CCA GCC AAC TGC TCC GTG GCC TCC CAA 80 
Val Leu Gly Thr Tyr ser Asn His Thr Ser Ala Pro Ala Asn cys Ser Val Ala Ser Gin 

CAC AAG TTG ACC CTG TCC GAA GTG ACC GGA CAG GGA CTC TGC AXA GGA GCA GTT CCC AAA40 
His Lys Leu Thr Leu Ber Glu Val Bhr Gly Gin Gly Leu Cys lie Gly Ala Val Pro Lys 

ACA CAT CAG GCC CTA TOT AAT ACC hCC CAG ACA AGC AOT CGA GGG TCC TAT TAT CTA GTS 00 
Thr His Gin Ala Leu Cys Asn Thr phr Gin Tfcr Ser Ser Arg Gly Ser Tyr Tyr Leu Val 
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GCC CCT ACA GOT ACC ATG TOG OCT TGT AOT ACC GGG CTT ACT CCA TGC ATC TCC ACC AOS 60 
Ala Pro Thr Gly Thr Met Trp Ala Cys 8er Thr Gly Lea Thr Pro Cys lie 8er Thr Thr 

ATA CTG AAC CTT ACC ACT GAT TAT TGT GTT CTT GTC GAA CTC TGG CCA AGA GTC ACC TM20 
lie Leu Asn Leu Thr Thr Asp Tyr Cys Val Leu Val Glu Leu Trp Pro Arg Val Thr Tyr 

CAT TCC CCC AGC TAT CTT TAC GGC CTG TOT GAG AGA TCC AAC CGA CAC AAA AGA GAA COG 80 
Els Ser Pro Ser Tyr Val Tyr Gly Leu Phe Glu Arg Ser Asn Arg His Lys Arg Glu Pro 

GTG TOO TTA ACC CTG GCC CTA TTA TTQ GOT GGA CTA ACC ATG GGG GGA ATT GCC GCT 00*40 
Val Ser Leu Thr Leu Ala Leu Leu Leu Gly Gly Leu Thr Mat Gly Gly He Ala Ala Gly 

ATA GGA ACA GGG ACT ACT GCT CTA ATG GCC ACT CAG CAA TTC CAG CAG CTC CAA GCC GOJS00 
He Gly Thr Gly Thr Thr Ala Leu Met Ala Thr Gin Gin Phe Gin Gin Leu Gin M» Ala 

GTA CAG GAT GAT CTC AGG GAG GIT feAA AAA TCA ATC TCT AAC CTA GAA AAG TCT CTC AOS 60 
Val Gin Asp Asp Leu Arg Glu Val plu Lys Ser He Ser Asn Leu Glu Lys Ser Leu Thr 

TCC CTG TCT GAA GTT CTC CTA CAG k&T CGA AGG GGC CTA GAC TTQ TTA TTT CTA AAA GU20 
Ser Leu Ser Glu Val Val Leu Gin pan Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu 

GGA GGG CTG TGT GCT GCT . CTA AAA GAA GAA TGT TCC TTC TAT GCG GAC CAC ACA GGfl CXK80 
Gly Gly Leu Cys Ala Ala Leu Lys Glu Glu Cys Cys Phe Tyr Ala Asp His Thr G ly Leu 

GTG AGA GAC AGC ATG GCC AAA TTG AGA GAG AGG CTT AAT CAG AGA CAG AAA CTG TTT GAG 40 
Val Arg Asp Ser Mat Ala Lys Leu Arg Glu Arg Leu Asn Gjbn Arg Lys Leu Phe Glu 

TCA ACT CAA GGA TGG TTT GAG GGA CTG TTT AAC AGA TCC CCT TGG TIT. ACC ACC TTG AM 00 
Ser Thr Gin Gly Trp Phe Glu Gly Leu Phe Asn Arg Ser Pro Trp Phe Thr Thr Leu He 

TCT ACC ATT ATG GGA CCC CTC ATT GTA CTC CTA ATG ATT TOG CTC TTC GGA CCC TGC ATP 60 
Ser Thr He Met Gly Pro Leu He Val Leu Leu Met He Leu Leu Phe Gly Pro Cys He 

CTT AAT CGA TTA GTT CAA TTT GIT ^AA GAC AGG ATC TCA GTA CTC CAG GCT TTA OTC CTO20 
Leu Asn Arg Leu Val Gin Phe Val Lys Asp Arg He Ser Val Val Gin Ala Leu Val Leu 

ACT CAA CAA TAC CAC CAG CTA AAA CCA CTA GAA TAC GAG CCA 1962 
Thr Gin Gin Tyr His Gin Leu Lys Pro Leu Glu Tyr Glu Pro 
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ATG GCG CGT TCA ACQ CTC TCA AAA CCC CCT CAA GAT AAG ATT AAC CCQ TCG AAQ CCC TTA60 
Met Ala Arg Ser Thr Leu ser Lys Pro Pro Gin Asp Lys lie Asn Pro Trp Lya Pro Leu 

ATA CTC ATG GGA GTC CTG TTA GGA GTA GGG ATG GCA GAG AGC CCC CAG GTC TTT AAT GTA 20 
He Val Met Gly Val Leu Leu Gly |/al Gly Met Ala Glu Ser Pro Gin Val Phe Asn Val 

ACC TGG AGA GTC ACC AAC CTG ATG fcCT GGG CGT ACC GCC AAT GCC ACC ICC CTC CTC GGA 80 
Iftr Trp Arg Val Thr Asn Leu Met (rhr Gly Arg Thr Ala Asn Ala Thr Ser Leu Leu Gly 

ACT OTA CAA GAT GCC TTC CCA AAA fcCA TAT TTT GAT CTA TGT GAT CTG GTC GGA GAG GAG 40 
Thr Val Gin Asp Ala Phe Pro Lys Leu Tyr Phe Asp Leu Cys Asp Leu Val Gly Glu Glu 

TGG GAC CCT TCA GAG CAG GAA CCG TAT GTC GGG TAT GGC TGC AAG TAG CCC GCA GGG AGA 00 
Trp Asp Pro Ser Asp Gin Glu Pro Tyr Val Gly Tyr Gly Cys Lys Tyr Pro Ala Gly Arg 

CAG CGG ACC CGG ACT TTT GAC TTT TAC GTG TGC CCT GGG CAT ACC GTA AAG TCG GGG TOT60 
Gin Arg Tftr Arg Thr Phe Asp Phe Tyr Val Cys Pro Gly His Thr Val Lys Ser Gly Cys 

GGG GGA CCA GGA GAG GGC TAC TGT GGT AAA TOG GGG TGT GAA ACC ACC GGA CAG GCT TAS20 
Gly Gly Pro Gly Glu Gly Tyr Cys Gly Lys Tip Gly Cys Glu Thr Thr Gly Gin Ala Tyr 

TGG AAG CCC ACA TCA TCG TGG GAC CTA ATC TCC CCT AAG CGC GGT AAC ACC CCC TCG GAS 80 
Trp Lys Pro Thr Ser Ser Trp Asp £eu lie Ser Leu Lys Arg Gly Asn Thr Pro Trp Asp 

ACG GGA TGC TCT AAA GTT GCC TGT GGC CCC TGC TAC GAC CTC TCC AAA GTA TCC AAT TC640 
Thr Gly Cys Ser Lys Val Ala Cya Gly Pro Cyn Tyr Asp Leu Ser Lys Val Ser Asn Ser 

TTC CAA GGG GCT ACT CGA GGG GGC AGA TGC AAC CCT CTA GTC CTA GAA TTC ACT GAT GCA 00 
Phe Gin Gly Ala Thr Arg Gly Gly J^rg cys Asn Pro Leu Val Leu Glu Phe Thr Asp Ala 

GGA AAA AAG GCT AAC TCG GAC GGG bCC AAA TCG TGG GGA CTG AQA CTC TAC CGG ACA GGA 60 
Gly Lys Lys Ala Asn Trp Asp Gly pro Lys Ser Trp Gly Leu Arg Leu Tyr Arg Thr Gly 

ACA GAT CCT ATT ACC ATG CTC TCC bTO ACC CGG CAG GTC CTT AAT GTC GGA CCC CGA GttJ20 
Thr Asp pro lie Thr Met Phe Ser pen Tfcr Arg Gin Val Leu Asn Val Gly Pro Arg Val 

CCC ATA GGG CCC AAC CCA GTA TTA CCC GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG ATT 80 
Pro tie Gly Pro Asn Pro Val Leu Pro Asp Gin Arg Leu Pro Ser Ser Pro lie Glu He 

GTA CCG GCT CCA CAG CCA CCT AGC CCC CTC AAT ACC ACT TAC CCC CCT TCC ACT ACC ACT 40 
Val Pro Ala Pro Gin Pro Pro Ser Pro Leu Asn Thr Ser Syr Pro Pro Ser Thr. Thr Ser 

ACA CCC TCA ACC TCC CCT ACA ACT CCA ACT GTC CCA CAG CCA CCC CCA GOT ACC GAA AA900 
Thr Pro Ser Thr Ser Pro Thr Ser Pro Ser Val Pro Gin Pro Pro Pro Gly Tfcr Glu Asn 

AGG CTG CTA AAC TEA GTA GAC GGA GCC TAC CAA GCC CTC AAC CTC ACC ACT CCT GAC AAA 60 
Arg Leu Leu Asn Leu Val Asp Gly fkla Tyr Gin Ala Leu Asn Leu Thr Ser Pro Asp Lys • 

ACC CAA GAG TCC TCG CTG TGT CTA GTA GCG GGA CCC CCC TAC TAC GAA GGG GOT GCC GHD20 
Skr Gin Glu Cys Trp Leu Cys Leu Val Ala Gly Pro Pro Tyr Tyr Glu Gly Val Ala Val 

CTG GGT ACC TAC TCC AAC CAT ACC TCT GCT CCA GCC AAC TGC TCC GTG GCC TCC CAA CA08O 
Leu Gly Thr Tyr Ser Asn His Thr ser Ala Pro Ala Asn Cys Ser Val Ala Ser Gin His 

AAG CTG ACC CTG TCC GAA GTG ACC GGA CAG GGA CTC TGC ATA GGA GCA GTT CCC AAA AOA40 
Lys Leu Thr Leu Ser Glu Val Thr fely Gin Gly Leu Cys lie Gly Ala Val Pro Lys Tlur 

CAT CAG GCC CTA TGT AAT ACC ACC pAG ACA AGC ACT CGA GGG TCC TAT TAT CTA GTT GOG 00 
His Gin Ala Leu Cys Asn Thr Thr Jsln Thr Ser Ber Ar ? Gly Ser Tyr Tyr Leu Val Ala 
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CCT ACA 
Pro Thr 


GOT 
Gly 


ACC 

Thr 


ATG 
Met 


TGG GCT 
Trp Ala 


TGT 
Cys 


ACT 
ser 


ACC 
Thr 


GGG 
Gly 


COT 
Leu 


ACT 
Thr 


CCA TGC ATP TCC ACC ACC A22260 
Pro Cys lie Ser Thr Thr He 


CTG AAC 
Leu Asn 


COT 
Leu 


ACC 
Thr 


ACT 
Ohr 


GAT TAT 
Asp Tyr 


TGT 
Cys 


CTT 
Val 


CTT 
Leu 


GTC 
Val 


GAA 
Glu 


CTC 
Leu 


TGG CCA AGA GTC ACC TAT G&B20 
Trp Pro Arg Val Thr Tyr His 


TCC CCC 
Ser Pro 


AGC 
Ber 


TAT 
Tyr 


GOT 
Val 


TAG GGC 
Tyr Gly 


CTG 
Leu 


OTT 
Phe 


GAG 
Glu 


AGA 
Arg 


TCC 
Ser 


AAC 
Asn 


CGA CAC AAA AGA GAA CCG CT380 
Arg His Lys Arg Glu Pro Val 


TCG TTA 

ser Leu 


ACC 

Thr 


CTG 
Leu 


GCC 
Ala 


CTA TTA 
Leu Leu 


TTG 
Leu 


GOT 
Gly 


GGA 
Gly 


CTA 
Leu 


ACC 
Thr 


ATG 
Met 


GGG GGA ATT GCC GCT GGA ATA 40 
Gly Gly He Ala Ala Gly He 


GGA ACA. 
Gly Thr 


GGG 
Gly 


ACT 
Thr 


ACT 
Thr 


GCT CTA 
Ala Leu 


ATG 
Met 


pec 
Ala 


ACT 
Thr 


CAG 


CAA 
Gin 


OTC 
Phe 


CAG CAG CTC CAA GCC GCA GT500 
m-n airy jjexL <3ln Ala Ala Val 


CAG GAT 
Gin Asp 


GAT 
Asp 


CTC 
Leu 


AGG 
Arg 


GAG GTT 
Glu Val 


GAA 
Glu 


kAA 

Lvs 


TCA 
Ser 


atc 
He 


TCT 
Ser 


AAC 
Asn 


CTA GAA AAG TCT CTC ACT TO06O 
Leu Glu Lys Ser Leu Thr Ser 


CTG TCT 
Leu Ser 


GAA 
Glu 


OTT 
Val 


GTC 
Val 


CTA CAG 
Leu Gin 


AAT 
Asn 


c 


AGG 
Arg 


GGC 
Gly 


CTA 
Leu. 


GAC 
Asn 


TTG TTA TOT CTA AAA GAA GGS20 

Leu Lao Pha Leu Lva Glu Glv 


GGG CTG 
Gly .Leu 


TGT 
Cys 


GCT 
Ala. 


GCT 
Ala 


CTA AAA 
Leu Lys 


GAA 
Glu 


GAA 
GlU 


TGT 
Cys 


TGC 
Cys 


TTC 
Phe 


TAT 
Tyr 


GCG GAC CAC ACA GGA CTA GT08O 
Ala Asp His Thr Gly Leu Val 


AOA GAC 
Arg Asp 


AGC 
Ser 


ATG 
Met: 


GCC 
Ala 


AAA TTG 
Lys Leu 


AGA 
Arg 


GAG 
GlU 


AGG 


COT 

Leu 


AAT 
Asn 


CAG 
gin 


AdA CAG AAA CTG TTT GAG TM40 
Arg Gin £ys Leu Pfee G^Lu Ser 


ACT CAA 
Thr Gin 


GGA 
Gly 


TGG 

Trp 


TTT 
Phe 


GAG GGA 
Glu Gly 


CTG 
Leu 


TTT 
Phe 


AAC 
Asn 


AGA 
Arg 


TCC 
Ser 


CCT 
Pro 


TGG TOT ACC ACC TTG ATA TUB 00 
Tip phe Thr Thr Leu He Ser 


ACC ATT 
Thr Ha 


ATG 
Met 


GGA 
Gly 


CCC 
Pro 


CTC AOT 
Leu He 


GTA 
Val 


CTC 
Leu 


CTA 
Leu 


ATG 
Met 


ATT 
lie 


TTG 
Leu 


CTC TTC GGA CCC TGC AOT CT860 
Leu Phe Gly Pro Cys He Leu 


AAT CGA 
Asn Arg 


TTA 
Leu 


GTT 
Val 


CAA 
Gin 


OTT GTT 
Phe Val 


AAA 

Lys 


GAC 
Asp 


AGG 
Arg 


ATC 
He 


TCA 
Ser 


GTA 
Val 


CTC CAG GCT TTA GTC CTG A03) 20 
Val Gin Ala Leu Val Leu Thr 


CAA CAA 
Gin Gin 


TAG 


CAC 

Hie 


CAG 
Gin 


CTA AAA 
Leu Lys 


CCA 

Pro 


CTA 
pen 


GAA 
GlU 


TAG 
Tyr 


GAG 

Glu 


CCA 
Pro 


1959 
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ATG QCQ CGT TCA ACQ CTC .TCA AAA CCC CTT AAA AAT AAG CTT AAC CCG CGA GGC CCC CTA60 
Met Ala Arg Ser Thr Leu Ser Lys Pro Leu Lys Asn Lys Val Asa Pro Arg Gly Pro Leu 

ATC CCC TEA ATT CTT CTO ATG CTC AGA GGG OTC ACT ACT GCG GCC GAG CCG GCC ATG GCU20 
He Pro Leu He Leu Leu Met Leu krg Gly Val Ber Thr Ala Ala Gin Pro Ala Met Ala 

CAG GTT CAG CTT CAG CAG TCT GGA GCT GAA CTG GTA AAG CCT GGG GCT TCA OTG AAG TTO80 
Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Lys Pro Gly Ala Ser Val Lys Leu 

TCC TGC AAG GCT TCT GGC TAG ACC IrTC ACA AGC TAT GAT ATA AAC TGG GTG AGG CAG AGG40 
Ser cya Lys Ala Ser Gly Tyr Thr £he Thr Ser Tyr Asp He Asn Trp Val Arg Gin Arg 

CCT GAA CAG GGA CTT GAG TGG ATT feGA TGG ATT TTT CCT GGA GAT GGT ACT ACT AAG TAG 00 
Pro Glu Gin Gly Leu Glu Trp He f3ly Trp He Phe Pro Gly Asp Gly Ser Thr Lys Tyr 

AAT GAG AAG TTC AAG GGC AAG GCC ^CA CTG ACT ACA GAC AAA TCC TCC AGC ACA GCC TAG 60 
Asn Glu Lys Phe Lys Gly Lys Ala Thr Leu Thr Thr Asp Lys Ser Ser Ser Thr Ala Tyr 

ATO CAG CTC AGC AGG CTG ACA TCT GAA GAC TCT GC!T ATC- TAT TTC TGT GCA AGA ATG GCG20 
Mat Gin Leu Ser Arg Leu Thr Ser Glu Asp Ser Ala lie Tyr Phe Cya Ala Arg Met Ala 

f 

GAC TAT GOT CAC TAG GGC GCT GTG GAC TAG TGG GOT CAA GGG ACC ACQ GTC ACC GTC TC880 
Asp Tyr Gly His Tyr Gly Ala Val Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr -Val Ser 

TCA GOT GGA GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC GGA TCG GAC ATC CAG ATS 40 
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Asp I^e Gin Jfct 

ACT CAG TCT CCA TCC TCC CTG TCT GCC TCT CTG GGA GAC AGA OTC ACC ATC ACT TGC AG* 00 
Thr Gin Ser Pro Ser Ser Leu Ser ^la Ser Leu Gly Asp Arg Val Thr He Ser Cys Arg 

GCA ACT CAG GAC ATT AGC AAT TAT TEA AAC TGG TAT CAG CAG ACA CCA GAT GGA ACT GTS 60 
Ala Ser Gin Asp He Ser Asn Tyr Xeu Asn Trp Tyr Gin Gin Thr Pro Asp Gly Thr Val 

AAA CTC CTG ATC TAG TAG ACA TCA AGA TTA CAC TCA GGA GTC CCA TCA AGG TTC APT GGG20 
Lys Leu Leu He Tyr Tyr Thr Ser K*g ***xi His Ser Gly Val Pro Ser Arg Phe Ser Gly 

ACT GGG TCT GGA ACA GAT TAT TCT brc ACC ATT AGC AAC CTG GAG CAA GAA GAT ATT GCC80 
Ser Gly Ser Gly Thr Asp Tyr Ser ten Thr He Ser Asn Leu Glu Gjn Glu Asp £le A*Ia v " 

ACT TAG TTT TGC CAA CAG GOT TAT kcG CTT CCG TGG ACQ TTC GGT GGA GGC ACC AAG CT840 
Thr Tyr Phe Cys Gin Gin Gly Tyr khr Leu Pro Trp Thr Phe Gly Gly Gly Thr Lys Leu 

GAG ATC AAA CGG GCG GCC GCA GGA CCC CGA GTC CCC ATA GGG CCC AAC CCA OTA TTA CCG 00 
Glu He Lys Arg Ala Ala Ala Gly Pro Arg Val Pro He Gly Pro Asn Pro Val Leu Pro 

GAC CAA AGA CTC CCT TCC TCA CCA ATA GAG ATT GTA CCG G&t CCA CAG CCA CCT AGC CCG 60 
Asp Gin Arg Leu Pro Ser Ser Pro He Glu He Val Pro Ala Pro Gin ?ro Pro Ser Pro 

CTC AAT ACC ACT TAG CCC CCT TCC ACT ACC ACT ACA GCC TCA ACC TCC CCT ACA ACT COfi^O 
Leu Asn Thr Ser Tyr Pro. Pro Ser Thr Thr Ser Thr Pro Ser Thr Ser Pro T^r Ser Pro * 

ACT OTC CCA CAG CCA CCC CCA GGA ACT GGA GAT AGA CTA CTA GCT CTA GTC AAA GGA GO08O . 
Ser Val Pro Gin Pro Pro Pro Gly |rhr Gly Asp Arg Leu Leu Ala Leu Val Lys Gly Ala 

TAT CAG GCG CTT AAC CTC ACC AAT CCC GAC AAG ACC CAA GAA TGT TGG CTG TGC TTA O1G40 
Tyr Gin Ala Leu Asn Leu Thr Asn Pro Asp Lys Thr Gin Glu Cys Trp Leu Cys Leu Val 

FIG. 30 



WO 00/71578 



PCT/EP00/04534 



49/49 



34.1 -A Sequence 



10 20 30 40 50 60 

123 456 789 ? *™ 234 567 890 123 456 789 012 345 ; 678 901 234 '567 890 " - 

TCG GGA CCT CCT TAT TAC GAA GGA GTA GCQ GTC OTQ QGC ACT TAT ACC AAT CAT TCC AOflOO 
ser oly Pro Pro Tyr Tyr Qlu Qly Val Ala Val Val Gly Thr Tyr Thr Asa His 8er Thr 

OCT CCG GCC AAC TCT ACG GCC ACT TCC CAA CAT AAG CTT ACC CTA TCP GAA GIG ACA GG06O 
Ala Pro Ala Asn cys Thr Ala Thr Ser Gin His Lys Leu Thr Leu Ser Glu Val Thr Gly 

CAG GGC CTA TGC ATG GGG GCA GTA CCT AAA ACT CAC CAG GCC OTA TOT AAC ACC ACC GA&20 
Glu Gly Leu Cya Met Gly Ala Val Pro Lys Thr Bis Gin Ala Leu Cys Asn Thr Thr Gin 

AGC GCC GGC TCA GGA TCC TAG TAC CTT GCA GCA CCC GCC GGA ACA ATG TGG GCT TGC AGO 80 
Ser Ala Gly ser Gly ser Tyr Tyr Leu Ala Ala Pro Ala Gly Thr Met Trp Ala Cys Ser 

ACT GGA TTG ACT CCC TGC TTG TCC JfcCC ACG GTG CTC AAT CTA ACC ACA GAT TAT TGT GTA 40 
Thr Gly Leu Thr Pro Cys Leu Ser Thr Thr Val Leu Asn Leu Thr Thr Asp Tyr Cys Val 

TTA GTT GAA CTC TGG CCC AGA GTA ATT TAC CAC TCC CCC GAT TAT ATG TAT GOT CAG CD 00 
Leu Val Glu Leu Trp Pro Arg Val lie Tyr Bis Ser Pro Asp Tyr Met Tyr Gly Gin Leu 

GAA CAG COT ACC AAA TAT AAA AGA hAG CCA GTA TCA TTG ACC CTG GCC CTT CTA CTA GOS60 
Glu Gin Arg Thr Lys Tyr Lys Arg plu Pro Val Ser ten Thr Leu Ala Leu Leu Leu Gly 

GGA TTA ACC ATG GGA GGG ATT GCA bCT GGA ATA GGG ACG GGG ACC ACT GCC TTA ATT Atf 20 
Gly Leu Thr Met Gly Gly lie Ala Ala Gly He Gly Thr Gly Thr Thr Ala Leu "Tie Lys 

ACC CAG CAG TTT GAG CAG CTT CAT bCC GCT ATC CAG ACA GAC CTC AAC GAA GTC GAA AA08O 
Thr Gin Gin Phe Glu Gin Leu His Ala Ala lie Gin- Thr Asp Leu Asn Glu Val Glu Lys 

TCA ATT ACC AAC CTA GAA AAG TCA CTG ACC TCG TTG TCT GAA GTA GTC CTA CAG AAC CO04O 
Ser lie Thr Asn Leu Glu Lys Ser Leu Thr Ser Leu ser Glu Val Val Leu Gin Asn Arg 

AGA GGC CTA GAT TTG CTA TTC CTA AAG GAG GGA GOT CTC TGC GCA GCC CTA AAA GAA GAB 00 
Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly Gly Leu Cys Ala Ala pen Lys GjLu GJ.u 

TGT TGT TTT TAT GCA GAC CAC ACG GGG CTA GTG AGA GAC AGC ATG GCC AAA TTA AGA QB*§0 
Cys Cys Phe Tyr Ala Asp Bis Thr Gly Leu Val Arg Asp Ser Met Ala Lys Leu Arg Glu 

AGG CTT AAT CAG AGA CAA AAA CTA jrTT GAG ACA GGC CAA GGA TGG TTC GAA GGG CTG TT^O 
Arg Leu Asn Gin Arg Gin Lys Leu Phe Glu Thr Qly Gin Gly Trp Phe Glu Gly Leu Phe 

AAT AGA TCC CCC TGG TTT ACC ACC TTA ATC TCC ACC ATC ATG GGA CCT CTA ATA GTA CTG 80 
Asn Arg Ser Pro Trp Phe Thr Thr Xeu lie Ser Thr lie Met Gly Pro Leu He Val Leu 

TTA CTG ATC TTA CTC TTT GGA CCT TCC ATT CTC AAT CGA TTG GTC CAA TTT GTT. AAA G204O 
Leu Leu lie Leu Leu Phe Gly Pro Pys lie Leu Asn Arg Leu Val Gin Phe Val Lys Asp 

AGG ATC TCA GTG GTC CAG GCT CTG bTT TTG ACT CAG CAA TAT CAC CAG CTA J&A CCC A$£9Q 
Arg lie ser Val Val Gin Ala Leu pal Leu Thr Gin Gin Tyr Bis Gjj^ Leu Lys Pro lie 

GAG TAC GAG CCA TGA 2115 
Glu Tyr Glu Pro . . . 
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